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(57) Abstract: Provided herein are combinatorial containing chimeric Major Histocompatibility Comples (MHC) Class I proteins 
displayed on the surfaces of recombinant yeast cells. Members of the libraries, especially where those libraries have been mutage- 
nized either with error-prone Polymerase Cahin Reaction or with site-directed oligonucleotide mutagenis, are improved in confor- 
mation stability or in binding to a target, e.g., a peptide or other ligands as compared with the stability or binding affinity of a wild 
type MHC Class I chimeric protein. The improved mutant chimeric proteins can be selected by various means, including fluores- 
cence activated cell storing with a fluorescent ligand bound to the surfaces of the yeast cells displaying the improved mutant chimeric 
protein. 



WO 02/46399 



PCT7US01/47817 



MUTATED CLASS I MAJOR HISTOCOMPATIBILITY PROTEINS AND COMPLEXES 

CROSS REFERENCE TO RELATED APPLICATIONS 
This application is a Continuation-in-Part of United States application No. 60/254,495, 
filed December 8, 2000. 
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This invention was made, at least in part, with funding from the National Institutes of 
Health (Grant No. PHS 5 ROl AI35990). Accordingly, the United States Government has 
certain rights in this invention. 

BACKGROUND OF THE INVENTION 
The field of the present invention is molecular biology, in particular, as it is related to 

combinatorial libraries of immune cell proteins displayed on th* rail cnrfrre nf a r p^/> rn ^j nan f 

host cell. More specifically, the present invention relates to a library of major 
histocompatibility locus proteins displayed on the surfaces of recombinant yeast cells, to 
mutant MHC Class I and proteins selected for improved binding to particular target peptides, 
to mutant MHC Class I proteins selected for binding to a particular antigen, to MHC Class I 
proteins of improved stability and to the use of the selected high affinity and/or more stable 
MHC derivatives in diagnostic methods and imaging assays, among other applications 
including prophylactic and therapeutic treatments. 

Proteins encoded by the major histocompatibility complex (called MHC proteins) are 
requisite components of the antigenic complexes that are involved in many diseases . These 
diseases include cases where the body reacts with one's own MHC proteins (in various 
autoimmune diseases)^infecSous^iseases and cancer, wherelhe MHC^emticalmbrnding 
and presenting foreign, antigenic peptides. In this invention, MHC proteins of the class I type 
were expressed as heterologous, surface-linked fusions on yeast cells with the goal of 
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generating improved MHC proteins. Libraries of mutant MHC and mutant peptide-MHC 
complexes could be screened for higher surface levels in order to identify variants that 
exhibited improved properties, including enhanced stability. For the first time, this system 
allows the directed evolution of MHC molecules that represent novel agents for various 
diagnostic and therapeutic applications. These agents could be used in cancer, infectious 
diseases (e.g., virus infections), and autoimmune diseases (e.g., multiple sclerosis, type I 
diabetes, rheumatoid arthritis). 

» 

T cell receptors (TCRs) and antibodies have evolved to recognize different classes of 
ligands. Antibodies function as membrane-bound and soluble proteins that bind to soluble 
antigens, whereas in nature, TCRs function only as membrane-bound molecules that bind to 

cell-associated peptide/MHC antigens. All of the energy of the antibody : antigen interaction 

> * 

i * • 

focuses on the foreign antigen, whereas a substantial fraction of the energy of the 
TCR:peptide/MHC interaction seems to be directed at the self-MHC molecule [Manning et al. 
(1998) Immunity 8:413 :425] . In addition, antibodies can have ligand-binding affinities that are 
orders of magnitude higher than those of TCRs, largely because of the processes of somatic 
mutation and affinity maturation. In their normal cellular context, TCRs do not iindergo 
somatic mutation, and the processes of thymic selection seem to operate by maintaining a 
narrow window of affinities [Alam et al. (1996) Nature 381:616-620]. The association of 
TCRs at the cell surface with the accessory molecules CD4 or CD8 also may influence the 
functional affinity of TCRs [Garcia et al. (1996) Nature 384:577-581]. Despite these 
differences, the three-dimensional structures of the two proteins are remarkably similar, with 
the hypervariable regions forming loops on a single face of the molecule that contacts the 
antigen. 

« 

■ 

There is a long felt need in the art for Class I MHC proteins and Class I MHC/peptide 
complexes with improved stability and/or with improved T cell regulatory properties. Such 
improved Class I MHC proteins or complexes are useful in activating T cells that participate 
in the removal of target cells including neoplastic cells and cells infected with pathogenic 
agents_mcludmg 3 _butjiotJU^ 
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improved Class I MHC proteins and complexes of the present invention are also improved for 
use as research tools. 

SUMMARY OF THE INVENTION 
5 The present invention provides combinatorial libraries of Class I MHC proteins 

displayed on the surfaces of recombinant host cells, for example, yeast cells, desirably, 
Saccharomyces cerevisiae. From such a library can be isolated mutant MHC proteins that 
exhibit a relatively high affinity for a peptide ligand of interest. Also within the scope of the 
present invention are methods for isolating mutant Class I MHC proteins with improved 
10 stability, especially as MHC/peptide complexes of improved stability. 

The present invention further provides Class I MHC/peptide complexes and proteins 
that exhibit increased stability over the wild type Class I MHC/peptide complex or MHC 
protein, and which MHC proteins exhibit high affinity for a peptide ligand interest. This 
15 ligand can be a peptide, a protein, a carbohydrate moiety, or a lipid moiety, among others. 

Suitable labels allowing for detection of a ligand bound to an MHC protein, directly or 
indirectly , include but are not limited to fluorescent compounds , chemUuminescent compounds , 
radioisotopes, chromophores, and others. The stable, MHC protein of the present invention, 
20 where it specifically binds to a tumor cell antigen with high affinity and specificity can be used 
in diagnostic tests to detect T cells that are specific for the Class I MHC/peptide complex. The 
Class I MHC/peptide complexes of the present invention can also be used to activate T cells 
and thus, to enhance an immune response to an antigen or target cell of interest. 

■ 

25 Also provided by the invention are novel yeast display vectors and surface expression 

constructs in which the portion of the fusion protein that mediates attachment to the cell surface 
(the AGA2 sequence) is located downstream, or at the carboxy end of the protein/sequence of 
interest. See Fig. 2. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 provides a ribbon diagram corresponding to the crystal structure of a class I 
major histocompatibility complex with highlighted subunits. The MHC is trimeric. The MHC 
K b a chain ("350 amino acids) binds to an 8-10 amino acid residue peptide in the peptide 
binding cleft. The p 2 microglobulin (99 amino acids) associates non-covalently with the a 



chain. 



10 



Figure 2 illustrates the displayed MHC protein of interest displayed on the yeast cell 
surface via a disulfide linkage through the AGA2 portion of the fusion protein comprising the 
MHC component. 



15 



Figure 3 illustrates diagrammaticaUy the pCT302 yeast surface display vector which 
contains a sequence encoding AGA2/HA-Class I MHC - c-myc fusion protein. This fusion 
protein coding sequence is expressed in yeast under the regulatory control of the GAL1-10 
promoter. A similar vector, pYDl, is commercially available from Invitrogen. 

Figure 4 provides diagrams of various single-chain Class I MHC constructs cloned into 
the pCT302 Yeast Display Vector. 



20 
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Figure 5 provides the results of flow cytometric analyses of various K b constructions. 
Yeast cells displaying K b /p2m, SIYR K b /p2m, dEV8 K b /b2m, and OVA K b /p2m were stained 
with antnwwyc Mab 9E10 and biotinylated anti-K b antibody (B.8.24.3) Mowed by FITC 
labeled F(ab') 2 goat anti-mouse IgG or SA-PE (shaded peaks). For a background control, 
yeast were treated with only the secondary stain (unshaded peak). Labeled yeast were analyzed 
on Coulter Epics XL flow cytometer. All four constructions displayed the properly folded K b 
molecule on the surface. 



30 



Figure 6A-6D illustrate histograms from CD69 up-regulation assays. Induction of T 
cell activation marker CD69 on naive splenocytes (10 6 ) in the presence of K b /p2m, anti-TCR 
-ScFv,.and.SIYR/K b /p2m-yeastcells^-Induced-yeast cells (10^ weremcubatedat37 0 C-ana"5% 
C0 2 with splenocytes from 2C TCR/RAG-r'- mice. After 20 hours in culture, cells were 
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harvested, washed in PBS/0.5% BSA and stained with biotinylated clonotypic antibody, 1B2 
PCranz et al. (1984) Proc. Natl. Acad. Sri. USA 81: 7922-7926] for 45 minutes. These yeast were 
washed and stained with a mixture of the early activation marker FITC anti-CD69 antibody and 
SA:PE. The yeast/T cell mixture was analyzed for bound FTTC-labeled anti-CD69 antibody 
by flow cytometry, gating on 1B2 positive T cells. The mean fluorescence units for FITC anti- 
CD69 antibody in the absence of yeast, presence of K b /b2m yeast, anti-TCR scFv yeast, and 
SIYR/K b /(J2m yeast are indicated and illustrated in the histograms. Yeast cells mat expressed 
K b with the agonist peptide (called "SIYR"; SIYRYYGL SEQ ID NO:l), but not yeast cells 
with K b and no peptide, were capable up-regulating the CD69 molecule on the 2C T cells. 
This up-regulation was even greater man observed with the positive control anti-TCR antibody 
KJ16 [Cho et al. (1998) /. Immunol. Methods 220(1-2): 179-188]. 
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Figure 7 shows the results of direct activation of T cells by yeast that express Class I 
Peptide/MHC, as measured by flow cytometry. Up-regulation on2C TCR/RAG 7 " splenocytes 
incubated with varying ratios of yeast (3 to 100 x 10 5 yeast and 10 6 T cells) were analyzed by 
flow cytometry. The mean fluorescence units of FTTC anti-CD69 antibody was detected on 
1B2 gated T cells following 20 hours of incubation (37°C, 5% COj) of 2C TCR/RAG"'" 
splenocytes with yeast that expressed: SIYR/K b /p2m , anti-TCR scFv (positive control), and 
K b /p2m (negative control). 

Figure 8 shows direct activation of T cells by pMHC on yeast, thus providing evidence 
for pMHC specificity excess peptide and anti-K b antibody inhibit CD69 up-regulation. In order 
to confirm the specificity of T cell activation, K b /p2m (negative control), SIYR/K b /p2m 
bearing yeast cells, and 2C TCR/RAG"'* splenocytes were incubated with excess OVA 
(SHNFEKL, SEQ ID NO:2) peptide or anti-K b antibody (50 Mg/ml B.8.24.3) at37°C, 5%C0 2 . 
Excess OVA peptide binds to K b and should compete for binding to the K b molecule, but it is 
not recognized by 2C T cells. Anti-K b antibody recognizes the al/a2 helices and should 
prevent binding by the T cell receptor from 2C T cells. After 1 hour, 2C TCR/RAG" 7 " 
splenocytes were mixed with the yeast and incubated for 20 hrs. Detection of CD69 on 2C T 
-ceUsfoHowing-incubation-withyeast-in-the absence or-presenGeof-inhibitors-wasdeteeted-with- 
FUC anti-CD69 antibody and reported as the mean fluorescence units. Both excess OVA 
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peptide and tie anti-K b antibody inhibited the up-regulation of CD69, confirming that the 
recognition of the yeast-bound SIYR/K b was specific. 

Figure 9 diagrammatically illustrates cloning and transformation via homologous 
recombination. Linear mutagenized MHC protein coding sequence and linearized pCT302 
vector are co-electroporated into yeast. This strategy is used in the preparation of the yeast 
display library for the MHC Class I protein. 
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Figure 10 illustrates the strategy for mutagenizing a yeast display library and screening 
that library by taking advantage of the binding properties of mutant vs wild type displayed 
proteins. Fluorescenfly labeled ligands or antibodies which bind the displayed protein are 
employed in flow cytometry assays. 



Figure 11 provides examples of sorting libraries generated by random mutagenesis. 
1 5 A SrYR/K b /p2m error prone library (SEP) and a dEV8/K b /p2m error prone library (dEP) were 
each sorted 3 times. The SEP and dEP yeast (10 7 ), after having been induced for 2 days, were 
washed with PBS/0.5% BSA, stained with biotinylated anti-K" (B.8.24.3) for 60 min, washed 
again, and stained with SA:PE for 30 min. The first sort isolated the top 1.0% of the 
population using a 1:25 dilution of anti-K b . The second and third sorts were stained with a 
1:250 and 1:500 dilution of anti-K b for SEP and 1:250 dilution for both the second and third 
sort of dEP. The second and third sorts isolated the top 0.25% and 0.1 % of the population, 
respectively. Representative histograms from the second and third sort of the SEP library 
illustrate the mean fluorescence shift of the total population or enhancement of the more 
fluorescent yeast cells from sort 2 to sort 3. This indicated that there were likely to be K b 
mutants that exhibited increased stability and hence, increased surface levels [Shusta, (1999) 
/. Mol. Biol. 292: 949-956]. 
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Figure 12 shows the K b surface levels of mutant clones isolated by sorting from 
dEV8/K b error prone PCR library. Following sorting, ten randomly selected clones from the 
dEV8/K b /|^mjibraryj^e_a^ 
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K b antibody (B.8.24.3). Binding was detected with SA:PE, and mean fluorescence for each 
clone was detennined. 
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Figure 13 shows sequences of mutant clones isolated by sorting from dEVS/K" error 
prone PCR library. The mutations in three clones selected from the dEV8/K b /p2m error prone 
library (sort 3) and the crystal structure with two of the mutations (W167R, Y63N) are shown. 

Figure 14 illustrates construction of a directed, mutagenic peptide library in dEV8/K b 
library construction. Mutagenic PCR of positions 1-3 of the peptide dEV8 was performed 
using the degenerate upstream primer (5' ATA TTT TCT GTT ATT GCT TCA GTT TTA 
GCA GCT AGC TTG GAT AAA AGA NNS NNS NNS AAA TTC 3\ SEQ ID NO:3) and 
a downstream vector-specific primer. Using homologous recombination, the mutagenic 
dEV8/K b /p2m PCR product and Nhe l-Nde I digested dEV8/K b /p2m pCT302 were 
electroporated into electrocompetent yeast (BY5465, strain EBY100). A dilution of the 
transformed library was plated on SD-CAA plates and incubated for 3 days at 30°C to obtain 
the library size (7 x 10 3 ). 



Figure 15 shows the properly folded «3 domain, but the al/«2 domains of L d are not 
displayed on yeast. L J /p2m yeast cells were pulsed for two hours with the L d -binding peptide 

20 QL9 (QLSPFPFDL, SEQ ID NO:4), washed whh PBS/0.5%BSA, and stained with anti-L d 
antibodies, 28. 14.8 (a3 specific) or 30-5-7 (al/oc2 specific). After 45 min, cells were washed 
and stained with FITC-labeled F(ab=) 2 goat anti-mouse IgG, and binding was detected using 
flow cytometry. The histograms generated from the L d /p2m displaying yeast represent the 
binding of the «3 specific antibody (28.14.8). The 30-5-7 antibody, ctl/a2 specific, did not 

25 show the same binding (histograms not included), suggesting that at least one of these two 
domains was not folded properly and thus might be stabilized by a process of directed 
evolution and yeast display. 



30 



Figure 16 illustrates the result of sorting a yeast display mutant library of "unstable" 
Cl ass I MHC L d . Yeast cells displaying L d /P2m were induced at 2 0 C for 1 days in rii» 
presence of an L d /p2m specific peptide, QLSPFPFDL (QL9, SEQ ID NO:4). L d /p2m yeast 
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were stained with supernatants from D. melanogaster cells expressing the high-affinity T cell 
receptor (m6), biotinylated anti-T cell receptor antibody (F23.1) and a streptavidin- 
phycoerythrin (SA:PE) conjugate (shaded peak). The L d /p2m yeast cells were also stained 
with an <x2 domain specific antibody (30-5-7) and an a3 domain specific antibody (28.14.8) 
followed by a FITC labeled F(ab') 2 goat anti-mouse IgG (shaded peak). For a background 
control, yeast cells were treated with only the secondary stains (unshaded peaks). Labeled 
yeast cells were detected using a Coulter EPICS XL flow cytometer. The L d /p2m a3 domain 
was folded properly on the surface, but not the a.2 domain or the TCR binding domain. 

Figure 17 demonstrates that the L d mutant 30.8 exhibits a properly folded TCR binding 
domain and properly folded a2/a3 domains. An L d /p2m error prone library was created using 
homologous recombination and electroporation into electrocompetent S. cerevisiae EBY100 
cells. The L d /p2m error prone yeast library was sorted with L d /pb2m a3 specific antibody, 
28.14.8, and an L d /P2m a2 specific antibody, 30-5-7. Sorted yeast cells were screened by 
flow cytometry for binding to 30-5-7 antibody and high affinity T cell receptor supernatant 
(m6), as described in Fig. 16 above. Unlike yeast cells that express the wild type L d /p2m, the 
L d P2m mutant, yeast cells expressing 30.8 were detected by the a2 specific antibody (30-5-7) 
and by the high affinity T cell receptor (m6) when the correct peptide (QL9) was added 
exogenously. Mutant 30.8 was also detected by the a3 specific antibody, 28.14.8. 

Figure 18 summarizes the results for a yeast display library of peptides that bind to 
MHC class I K\ Known K b peptides (SIYRYYGL, SEQ ID NO:l; EQYKFYSV, SEQ ID 
NO:5; SHNFEKL, SEQ ID NO:2) have anchor residues at peptide positions 5 and 8. P5 
requires an aromatic amino acid whereas P8 requires a hydrophobic amino acid. Yeast 
displaying a K b peptide binding motif, but with the AGA2 fused at the amino terminus, were 
isolated after incubation of yeast cells with the dEV8 (EQYKFYSV)/K b tetramer. 

Figure 19 shows that yeast cells displaying the K b peptide motif were detected by flow 
cytometry using a fluorescent-labeled K b tetramer. Yeast cells with the C-terminal sequence 

shown-in-the top-panel-w^^ 

dEV8 peptide dissociates from the K b pocket, and the AGA2-fused peptide on the yeast surface 

8 
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binds to the free K b . Binding of the peptide motif to the K b tetramer was detected by flow 
cytometry (shaded peak) . For a background control, yeast cells were treated with streptavidin- 
phycoerythrin (SA:PE) conjugate alone. Mutagenic substitution of the proline residue at 
position P4 with an alanine residue reduced binding significantly (bottom panel), indicating that 
the prolione is important in K b -binding. 
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Figure 20 shows that novel class I MHC binding peptides are isolated by yeast display 
technology. Using the lead peptides identified by yeast display experiments (Fig. 19), one can 
now identify improved peptides by production of libraries that contain mutations in non-anchor 
residues (e.g. P6, F7, and P9). 
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DETAILED DESCRIPTION OF THE INVENTION 
The role of proteins encoded by the major histocompatibility complex (called MHC 
proteins) has now been known for almost twenty years. MHC proteins are expressed by every 
individual and function as "antigen-presenting" molecules. That is, each MHC protein can 
bind to a variety of different small peptides (8 to 20 amino acids in length) that are derived 
from proteins present inside a cell. MHC proteins present both self-peptides (i.e., derived 
from an individual's own endogenous proteins) and foreign peptides (i.e., derived from a 
foreign agent such as a virus). Once a peptide is bound to the MHC protein, the entire peptide- 
MHC complex is expressed on the surface of the cell. If the peptide is foreign, a T lymphocyte 
(T cell) can potentially recognize the complex, and the resultant interaction of the T cell 
receptor (TCR) and the pMHC can result in T cell activation. T cell activation can lead to 
recruitment of other immune cells and a corresponding inflammatory reaction. Such 
inflammatory reactions are beneficial if the pMHC target antigen is, in fact, derived from an 
infectious agent or from a transformed cell (i.e., cancer). However, such inflammatory 
reactions can be very detrimental if the pMHC target antigen is derived from self tissue, as the 
reactions can lead to severe autoimmune disease, where an individual's immune system attacks 
normal tissue. Such is the case when a patient's lymphocytes attack the islet cells of the 
pancreas (type I diabetes), the nervous system (multiple sclerosis), or joint-derived components 
irheumatoid.arthritis) 
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The central role of pMHC complexes in these phenomena has been established by 
thousands of pubUshed studies that include genetic linkages of diseases to the human MHC 
locus (HLA). It has now also been established that it is possible to use appropriately 
characterized peptide-MHC molecules as either agonists of an immune response (e.g., in 
cancer and infectious diseases) or as antagonists of responses (e.g. , in autoimmune responses). 
While several approaches have been taken to produce such pMHC complexes in soluble forms 
for these purposes and for biochemical/structural studies, it has not been possible to use current 
methods of in vitro directed evolution to improve the stability or antigenicity of the pMHC 
complex. This is because the pMHC complex is normally a membrane-associated complex 
composed of multiple different subunits (heavy chain, beta-2-microglobulin, and peptide in the 
case of a class I MHC and a-chain, p-chain, and peptide in the case of class II MHC) and such 
proteins are typically not amenable to the current methods of directed evolution (primarily 
phage display). The present invention shows that a display system for directed evolution can 
be used to express properly folded Class I MHC proteins on the surface of yeast. The 
displayed peptide-MHC complexes can be used to directly activate T cells, in order to 
identify/screen for pMHC antigens . In addition, mutated libraries of the pMHC proteins could 
be created and used for selection by flow sorting of stabilized pMHC variants. The stabilized 
variants can be identified because they were expressed at higher levels on the yeast surface and 
can therefore be easily identified by using a fluorescent-labeled probe for the pMHC construct, 
combined with high-throughput flow cytometric sorting or such cells. 

The Class I MHC proteins (see Fig. 1 for ribbon diagram) are composed of an a chain 
of 350 amino acids and a P2m chain of 99 amino acids; the two chains are noncovalently 
associated with one another, Peptides that are about 8-10 residues bind the Class I MHC 
molecules. About 10 5 to 10 6 peptide-Class I MHC complexes are displayed on the surfaces of 
nucleated cells. Class I MHC complexes are involved in the recognition of virus infected cells, 
pathogen-infected cells, and tumor antigens by cytotoxic T lymphocytes. 

To date, no Class I MHC protein has been crystallized without a peptide bound in the 
peptide-bmdmg-cieft— Some-Q^^ 

soluble form because of their instabilities. For example, the mouse Class I MHC K b is 
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relatively stable, and it has been used in many studies. By contrast, the mouse Class I MHC 
L d is less stable, and it has been more difficult to produce. The stability of a particular 
peptide-Class I MHC complex is directly related to its ability to stimulate efficient T cell 
responses , for example, in vaccine applications . A system for the in vitro evolution of more 
5 stable peptide-MHC complexes allows for novel agents and vaccines to be used in stimulating 
protective immune responses against diseases and neoplastic conditions. 

The present invention allows the creation and isolation of stabilized variants of peptide- 
MHC complexes. Toward this end, we have displayed single-chain peptide/Class I MHC (a 

10 chain/(52m) complexes on the surface of yeast cells. Yeast cells expressing a specific 
peptide/MHC complex (SIYR/K b ) on the cell surface were capable of directly activating T 
cells, thus providing a method for screening powerful T cell agonists. Mutant libraries of a 
less stable peptide/MHC complex (dEV8/K b ) were made and expressed, and mutant MHC 
polypeptides of increased stability were isolated. Similarly, the less stable MHC L d was 

15 expressed as a mutant library displayed on the surface of recombinant yeast cells, and more 
stable L d variants were isolated using flow cytometry screening methodology. WO 99/36569, 
incorporated by reference herein, provides abundant discussion of this display technology. 

Fig. 2 illustrates the MHC protein of interest displayed on the yeast cell surface via a 
20 disulfide linkage through the AGA2 portion of the fusion protein comprising the MHC 
component. AGA2 is a mating adhesion receptor which is naturally bound to the cell surface 
in disulfide linkage to the AGA1 protein. The HA and the c-myc portions of the displayed 
fusion protein serve as epitope tags and can be used in normalizing the fluorescent peptide 
binding data. Each recombinant yeast cell displays about 50,000 copies of the surface-bound 
25 fusion protein (if stable) on its surface. A fluorescent antibody or peptide ligand is added, and 
the cells are sorted using flow cytometry. Those MHC fusion proteins of increased stability 
exhibit stronger binding of the fluorescent ligand, and these cells are selected during the cell 
sorting procedure. 

30 Eig,-3-iUustrates-diagrammatically-the-pCT3Q2-y 

contains a sequence encoding AGA2/HA-Class I MHC - c-myc fusion protein. This fusion 
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protein coding sequence is expressed in yeast under the regulatory control of the GAL1-10 
promoter. A similar vector, pYDl , is commercially available from Invitrogen (Carlsbad, CA). 

Fig. 4 shows various constructs for the expression of fusion proteins containing 
portions representing the peptide binding portions of various Class I MHC proteins. 
Surprisingly, the AGA2 portion was functional either at the N-terminus or the C-terminus of 
the fusion protein, and it mediated binding to the yeast cell wall surface when associated with 
any of the protein portions tested and in both the amino- or carboxyl terminal positions. In 
Fig. 4 SS refers to a signal peptide sequence necessary for proper intracellular transport of the 
fusion protein. The signal peptide is cleaved prior to display on the cell surface. The MHC 
polypeptide coding portion includes a sequence encoding a 15 amino acid spacer between the 
COOH terminus of the a chain and the NH 2 end of the P2m portion. HA refers to the peptide 
tag derived in sequence from hemagglutinin, which tag is located at the amino terminus of the 
a chain. C-myc refers to the peptide tag at the COOH terminus of the b2m portion. The three 
peptides linked at the amino terminus of K b represent a strong agonist ("SIYR" peptide, SEQ 
ID NO:l), a weak agonist (dEV8) and a null peptide (OVA) for the T cell clone called 2C. 
The AGA2 coding sequence was cloned at the COOH termini of these constructs in order to 
allow free NH 2 termini of the peptides, which is generally thought to be important for proper 
binding to Class I MHC polypeptides. All fusion protein coding sequences were assembled 
using standard polymerase chain reaction (PCR) strategies. 

The correct foldings of various fusion display proteins were confirmed by fluorescence 
activated cell sorting after binding of the recombinant yeast cells to fluorescently labeled 
antibody specific for c-myc or K b . See Fig. 5. 

To demonstrate the direct activation of T cells by recombinant yeast cells expressing 
a peptide/MHC complex, naive splenic T cells from a TCR transgenic mouse (2C TCR tg) 
were incubated with yeast cells expressing SIYR/K\ K b (negative control) or anti-TCR scFv 
(positive control). After about 20 hours in culture, the T cells were analyzed by flow 
- cytometry for ihe-u^ 
graphical display of the results. 
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Fig. 7 shows the dependence of T cell activation on T cells and on the presence of a 
peptide/MHC complex. Fig. 8 demonstrates the pMHC specificity: excess free peptide and 
anti-K b antibody inhibit CD69 up-regulation in T cells. 

We have demonstrated direct activation of T cells by yeast cells that express 
peptide/MHC. Splenocytes (10 6 ) from 2C TCR/RAG-l'- mice were combined with 
approximately 3-100 x 10 s of KF washed yeast cells displaying K b /p2m, SIYR/K b /p2m, or 
anti-TCR scFv (positive control) at 37 C, 5% C0 2 in a 24 well plate. After 20 hours in 
culture, cells were harvested, washed in PBS/0.5%BSA and stained with biotinylated 
clonotypic antibody, 1B2 [Kranz et al. (1984) supra] for 45 minutes. These yeast/T cell 
mixtures were washed and stained with a mixture of the early activation marker FITC-labeled 
anti-CD69 antibody and SA:PE. The yeast/T cell mixture was analyzed for the CD69 marker 
(present only on activated T cells) by flow cytometry and gating on 1B2 positive T cells. 

The yeast display system was exploited to produce a random mutagenized library from 
which stabilized mutant Class I MHC K b sequences were isolated. Constructs encoding the 
fusion proteins containing SIYR/K b and dEV8/K b portions were mutagenized randomly using 
error prone PCR (0.16 Mn:Mg molar ratio). The homologous recombination scheme 
illustrated in Fig. 9 was employed to create the libraries. From the SIYR/K b experiment, 3.6 
million transformants were recovered. From the dEV8/K b experiment, 2.7 million 
transformants were recovered. 5 plasmid inserts from each mutated library were sequenced 
(about 3200 bp of sequence per library) to determine the mutation frequency. In the SIYR/K b 
experiment the mutation frequency was 0.37% (2 wild type sequences, 3, 4, and 5 mutant 
sequences). In the dEV8/K b experiment, there was a 0.06% mutation frequency (4 wild type, 
1 mutant sequence). 

■ 

Fig. 10 diagrammatically presents the generic screening strategy for screening the 
mutated yeast display libraries. Fig. 11 illustrates the results for second and third sorts with 
anti-K b monoclonal antibody B.8.24.3. In the second sort the top 0.25% of the cells 
(accordmg-to fluoresces 
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The profiles were similar for the dEV8/K b 



Fig. 12 illustrates different K b binding as reflected in different levels of fluorescent 
peptide bound after sorting of the dEV8/K b mutant library produced using error prone PCR. 
Fig. 13 presents selected sequences of mutant clones isolated by sorting of the dEV8/K b mutant 
library, and the positions of the mutations are shown on the ribbon diagram of the Class I 
MHC protein. The following provides further details on the construction of a dEV8/K b mutant 
library. From the ribbon diagram it is deduced that the sequence EQYKFYSV (SEQ ID 
NO:5) is important. The E (PI, the first amino acid of the peptide sequence) is buried in the 
first pocket. Q, P2, is directed down into the pocket. Y, P3, is a bulky, secondary anchor 
residue. K, P3, is a primary TCR contact, and it is directed out of the pocket. The F residue 
(P4) is an aromatic residue directed into the pocket; it serves as a primary MHC anchor. The 
Y residue at P6 is a large aromatic residue which also functions in TCR contact. The S residue 
(P7) is a small residue; the small size is dictated by the space available. The last V residue 
(P8) serves as a primary anchor residue, and it is directed into the pocket. To select mutations 
that stabilize dEV8 binding to K b , mutations are introduced in the library at positions that point 
into K b . A degenerate library is produced at P1-P3 (i.e. , NNNKFYSV, SEQ ID NO:6). That 
library is constructed by error prone PCR and the mutant library is introduced into the wild 
type dEV8/K b plasmid by homologous transformation as shown in Fig. 9. Isolated 
recombinants are then sequenced. Of four sequenced, each contained different nucleotides 
encoding P1-P3. See Fig. 14 and its description herein above. 



The motivations for using directed evolution to isolate stabilized variants of L d include 
the lower stability of the wild type L d protein based on biochemical studies, the suboptimal 
loading of peptides in the endoplasmic reticulum, the slower intracellular transport of L d (4 hr 
v 1 hr), the lower cell surface expression (2-4 fold lower) and fewer a/p2m contacts. See 
Table 1. 



Fig.-15-provides-data-demonstrating-^ — 

ocl/«2 domains of L d are displayed on the surface of recombinant yeast cells. We have also 
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demonstrated that anti-L d al/2 domain (monoclonal antibody 30.5.7) did not bind to the same 
L d /p2m displayed on yeast cell surfaces. 



In summary, the Class I MHC K b was cloned as an AGA2 fusion in single chain format 
(AGA2-HA-K b -p2m-c-wyc). The full protein was detected on the yeast cell surface using an 
antibody that recognizes the properly folded cd/a2 domains of K b . The coding sequences of 
this protein were also cloned with AGA2 at the carboxy terminus, with K b binding peptides 
(SIYR, dEV8 and OVA) linked at the amino termini (peptide-HA-K b -p2m-c-myc-AGA2). 
Yeast that expressed the T cell specific peptide K b on the cell surface were capable of directly 
activating T cells, thereby providing a system for screening T cell agonists. 

i 

i 

To create stabilized K b mutant fusion protein, two different mutational libraries were 
produced by error prone PCR to yield random mutations after subsequent homologous 
recombination of mutagenized coding sequence and linearized vector after co-electroporation. 
Mutants that displayed higher yeast surface levels, and thus presumptive enhanced stability, 
were identified by flow cytometry sorting. 



The unstable class MHC protein L d was also cloned as an AGA2 fusion i n single chain 

format (AGA2-HA- L d »p2m-c-myc). L d was detected on the yeast cell surface with an <x3- 
specific antibody, but properly folded al/a2 domains were not detected with anti- L d antibody, 
suggesting that stabilized variants of L d can be engineered by directed evolution and flow 
sorting for improved yeast surface expression. 

The present inventors have now succeeded in isolating stabilized mutants of the class 
I molecule L d . The L d molecule has been shown to be relatively unstable. When expressed 
in the yeast display system, only the a3 domain appears to be folded properly on the surface 
(Fig. 16) . The displayed protein is not recognized by a conformation-specific oc2 antibody (30- 
5-7) nor is the QL9/L d complex recognized by the high-affinity T cell receptor called m6 
(Fig. 16). To isolate stabilized L d , a library of random mutants was expressed in yeast, and the 
Ubrarywas^ 

mutants were isolated. One mutant (30.8) is shown in Fig. 17. Mutant 30.8 bound to both the 
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a2-specific antibody and the QL9/L d (30.8) complex bound to the high affinity TCR m6, 
reflecting increased stability over its wild type parent protein. 

Another application of the yeast display to technology is to identify peptides that bind 
5 directly to MHC class I proteins. Serendipitously, we isolated a peptide that bound to the K b 
molecule even though the peptide was fused to the C-terminus of the AGA2 protein. This 
finding was completely unexpected. Previously, only peptides of 8 to 10 amino acids had been 
shown to bind to class I MHC products . This restriction in length was thought to be due to the 
need for both the N- and G-termini to bind within pockets of the class I molecule, thereby 

10 contributing binding energy to the interaction. AGA2-fusion peptides were isolated by 
screening with a dEV8/K b tetramer. A carboxy terminal peptide sequence HYSPFRQLA (SEQ 
ID NO: 37) that had K b -binding consensus anchor residues at positions P5 and P8 was isolated 
(Fig. 18). Yeast displaying the AGA2-fusion with the sequence HYSPFRQLA (SEQ ID 
NO: 37) at the C-terminus bound to dEV8/K b tetramers (Fig. 19). This binding was a 

15. consequence of the relative instability of dEV8 binding to K b . This instability apparently 
allows the peptide dEV8 to dissociate, and the surface-bound AGA2-fusion peptide binds to 
the free K\ Without wishing to be bound by any particular theory, we believe that the proline 
at position P4 allows the peptide to exit the K b -site at the residue amino terminal to the P5 
anchor. This is supported by the finding that an alanine substitution mutation at this position 

■ 

20 has significantly reduced binding (Fig. 19). These findings show that it is possible to create 
a library of fused peptides (to AGA2) and that this library can be screened for K b -binding (Fig. 
20). Thus, the yeast display system can be used to identify novel class I-binding peptides. 

In order to provide a clear and consistent understanding of the specification and claims , 
25 including the scope to be given to such terms, the following definitions are provided. 

A coding sequence is the part of a gene or cDNA which codes for the amino acid 
sequence of a protein, or for a functional RNA such as a tRNA or rRNA. 

30 Complement or complementaiy-sequence means a sequence of -nucleotidesJwMch forms- 

a hydrogen-bonded duplex with another sequence of nucleotides according to Watson-Crick 
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base-pairing rules. For example, the complementary base sequence for 5'-AAGGCT-3 ? is 3'- 
TTCCGA-5\ 

Downstream means on the 3' side of any site in DNA or RNA. 

Expression refers to the transcription of a gene into structural RNA (rRNA, tRNA) or 
messenger RNA (mRNA) and subsequent translation of a mRNA into a protein. 

An amino acid sequence that is functionally equivalent to a specifically exemplified 
MHC protein sequence is an amino acid sequence that has been modified by single or multiple 
amino acid substitutions, by addition and/or deletion of amino acids, or where one or more 
amino acids have been chemically modified, but which nevertheless retains the binding 
specificity and high affinity binding activity of a cell-bound or a soluble MHC protein of the 
present invention. Functionally equivalent nucleotide sequences are those that encode 
polypeptides having substantially the same biological activity as a specifically exemplified cell- 
bound or soluble MHC protein. In the context of the present invention, a soluble MHC protein 
is lacks the portions of a native cell-bound MHC and is stable in solution (i.e., it does not 
generally aggregate in solution when handled as described herein and under standard conditions 
for protein solutions). 

■ 

Two nucleic acid sequences are heterologous to one another if the sequences are derived 
from separate organisms, whether or not such organisms are of different species, as long as 
the sequences do not naturally occur together in the same arrangement in the same organism. 

Homology refers to the extent of identity between two nucleotide or amino acid 
sequences. 

Isolated means altered by the hand of man from the natural state. If an "isolated" 
composition or substance occurs in nature, it has been changed or removed from its original 
e nvironme nt, or both. For example, a polynucleotide or a polypeptide naturally present in a 
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living animal is not isolated, but the same polynucleotide or polypeptide separated from the 
coexisting materials of its natural state is isolated, as the term is employed herein. 

A linker region is an amino acid sequence that operably links two functional or 
5 structural domains of a protein. 

A nucleic acid construct is a nucleic acid molecule which is isolated from a naturally 
occurring gene or which has been modified to contain segments of nucleic acid which are 
combined and juxtaposed in a manner which would not otherwise exist in nature. 

10 

Nucleic acid molecule means a single- or double-stranded linear polynucleotide 
containing either deoxyribonucleotides or ribonucleotides that are linked by 3 f -5 T - 
phosphodiester bonds. 

1 5 Two DN A sequences are operably linked if the nature of the linkage does not interfere 

with the ability of the sequences to effect their normal functions relative to each other. For 
instance, a promoter region would be operably linked to a coding sequence if the promoter 

were capable of effect ing transcription of that coding sequence. 

■ 

20 A polypeptide is a linear polymer of amino acids that are linked by peptide bonds. 

Promoter means a cis-acting DNA sequence, generally 80-120 base pairs long and 
located upstream of the initiation site of a gene, to which RNA polymerase binds and initiates 
correct transcription. There can be associated additional transcription regulatory sequences 
25 which provide on/off regulation of transcription and/or which enhance (increase) expression 
of the downstream coding sequence. 

A recombinant nucleic acid molecule, for instance a recombinant DNA molecule, is a 
novel nucleic acid sequence formed in vitro through the ligation of two or more 
30 "nonhOToologous DNAl^ie^es~(fdf" sXmpl^recomb^nt^I^Md'conSffing one or more 
inserts of foreign DNA cloned into at least one cloning site), through PCR technology or by 
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directed homologous recombination, e.g. by co-transformation of two or more DNA molecules 
having at least regions of limited sequence identity to allow for homologous recombination 
with the transformed cell. . 

Transformation means the directed modification of the genome of a cell by the external 
application of purified recombinant DNA from another cell of different genotype, leading to 
its uptake and integration into the subject cell's genome. In bacteria, the recombinant DNA 
is not typically integrated into the bacterial chromosome, but instead replicates autonomously 
as a plasmid. 

Upstream means on the 5 ' side of any site in DNA or RNA. 

A vector is a nucleic acid molecule that is able to replicate autonomously in a host cell 
and can accept foreign DNA. A vector carries at least one origin of replication functional in 
at least one type of cell, one or more unique recognition sites for restriction endonucleases 
which can be used for the insertion of foreign DNA, and usually selectable markers such as 
genes coding for antibiotic resistance, and often recognition sequences (e.g. promoter) for the 
expression of the inserted DNA. Common vectors include plasmid vectors and phage vectors . 
There can be more than one origin of replication to allow for replication and maintenance in 
more than one type of cell (e.g., separate origins of replication functional in yeast and 
Escherichia coli, respectively). 

■ 

A virus infected cell is a cell in which a virus is replicating. Typically, a virus infected 
cell displays at least one antigen on its surface which is characteristic of the virus infection 
process. Such an antigen can be the target of recognition by the immune system and 
subsequent killing of the cell. 

A pathogen infected cell is a human or animal cell in which an intracellular parasite 
(bacterial, fungal or protozoan) is surviving or reproducing. Typically a pathogen infected cell 
displa ys at least one _antig^^nits_surface^Mch_is_characteristic_o 
antigen can be the target of immune recognition and targeting for killing of the infected cell. 
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The role of proteins encoded by the major histocompatibility complex (MHC proteins) 
have been known for nearly twenty years. MHC proteins are expressed by every individual, 
and they function as antigen-presenting molecules. Each MHC protein can bind to a variety 
of different small peptides (8 to 20 amino acids). 



Recently it was demonstrated that a scTCR (Vp-linker-Va) could be displayed on the 
surface of yeast [Kieke et al. (1999) Proc. Natl. Acad. Sci. USA 96:5651-5656], in the yeast 
display system that has proven successful in antibody engineering [Boder and Wittrup (1997) 
Nat. Biotech. 15: 553-557.; Kieke et al. (1999) supra]. In addition, it was shown that mutations 
that increased the surface levels of the TCR also increased the sability of the TCR in solution 
[Shusta et al. (1999) /. Mol. Biol. 292:949-956] . Thus, yeast surface display can now be used 
to isolate proteins that exhibit greater stability. 

It will be appreciated by those of skill in the art that, due to the degeneracy of the 
genetic code, numerous functionally equivalent nucleotide sequences encode the same amino 
acid sequence of an improved Class I MHC protein or class I MHC/peptide complex. 

Additionally, those of skill in the art, through standard mutagenesis techniq ue, i n 

conjunction with the antigen-finding activity assays described herein, can obtain altered MHC 
class I protein sequences and test them for the expression of polypeptides having particular 
binding activity. Useful mutagenesis techniques known in the art include, without limitation, 

i 

oligonucleotide-directed mutagenesis, region-specific mutagenesis, linker-scanning 
mutagenesis, and site-directed mutagenesis by PCR [see e.g. Sambrook et al. (1989) vide infra, 
and AuSubel et al. (1999) vide infra]. 

In obtaining variant MHC coding sequences, those of ordinary skill in the art will 
recognize that MHC-derived proteins can be modified by certain amino acid substitutions, 
additions, deletions, and post-translational modifications, without loss or reduction of 
biological activity. In particular, it is well-known that conservative amino acid substitutions, 
that-isrsubstitution of one amino acid for anote 

and conformation, are unlikely to significantly alter protein function. The 20 standard amino 
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acids that are the constituents of proteins can be broadly categorized into four groups of 
conservative amino acids as follows: the nonpolar (hydrophobic) group includes alanine, 
isoleucine, leucine, methionine, phenylalanine, proline, tryptophan and valine; the polar 
(uncharged, neutral) group includes asparagine, cysteine, glutamine, glycine, serine, threonine 
and tyrosine; the positively charged (basic) group contains arginine, histidine and lysine; and 
the negatively charged (acidic) group contains aspartic acid and glutamic acid. Substitution in 
a protein of one amino acid for another within the same group is unlikely to have an adverse 
effect on the biological activity of the protein. 

Homology between nucleotide sequences can be determined by DNA hybridization 
analysis, wherein the stability of the double-stranded DNA hybrid is dependent on the extent 
of base pairing that occurs. Conditions of high temperature and/or low salt content reduce the 
stability of the hybrid, and can be varied to prevent annealing of sequences having less than 
a selected degree of homology. For instance, for sequences with about 55% G - C content, 
hybridization and wash conditions of 40 - 50 C, 6 X SSC (sodium chloride/sodium citrate 
buffer) and 0.1% SDS (sodium dodecyl sulfate) indicate about 60 - 70% homology, 
hybridization and wash conditions of 50 - 65 EC, 1 X SSC and 0.1 % SDS indicate about 82 - 
97% homology, and hybridization and wash conditions of 52 C, 0.1 X SSC and 0.1% SDS 
indicate about 99 -100% homology. A wide range of computer programs for comparing 
nucleotide and amino acid sequences (and measuring the degree of homology) are also 
available, and a list providing sources of both commercially available and free software is 
found in Ausubel et al (1999). Readily available sequence comparison and multiple sequence 
alignment algorithms are, respectively, the Basic Local Alignment Search Tool (BLAST) 
(Altschul et al. y 1997) and ClustalW programs. BLAST is available on the Internet at 
http://www.ncbi.nlm.nih.gov and a version of ClustalW is available athttp://www2.ebi.ac.uk. 

Industrial strains of microorganisms (e.g., Aspergillus niger, Aspergillus ficuum, 
Aspergillus awamori, Aspergillus oryzae, Trichoderma reesei, Mucor miehei, Kluyveromyces 
lactis, Pichiapastoris, Saecharornyces cerevisiae, Escherichia coli, Bacillus subtilis or Bacillus 
lichenifomiisyovplmtspeci^ 

be used as host cells for the recombinant production of the mutant MHC proteins of the present 
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invention. As the first step in the heterologous expression of a high affinity MHC protein or 
soluble protein, an expression construct is assembled to include the MHC or soluble MHC 
coding sequence and control sequences such as promoters, enhancers and terminators. Other 
sequences such as signal sequences and selectable markers may also be included. To achieve 
5 extracellular expression of a soluble MHC polypeptide, the expression construct may include 
a secretory signal sequence. The signal sequence is not included on the expression construct 
if cytoplasmic expression is desired. The promoter and signal sequence are functional in the 
host cell and provide for expression and secretion of the MHC or soluble MHC protein. 
Transcriptional terminators are included to ensure efficient transcription. Ancillary sequences 
1 0 enhancing expression or protein purification may also be included in the expression construct. 

Various promoters (transcriptional initiation regulatory region) may be used according 
to the invention. The selection of the appropriate promoter is dependent upon the proposed 
expression host. Promoters from heterologous sources may be used as long as they are i 
1 5 functional in the chosen host. 

Promoter selection is also dependent upon the desired efficiency and level of peptide 
or protein production. Inducible promoters such tac are often employed in order to 
dramatically increase the level of protein expression E. coli. Overexpression of proteins may 
20 be harmful to the host cells. Consequently, host cell growth may be limited. The use of 
inducible promoter systems allows the host cells to be cultivated to acceptable densities prior 
to induction of gene expression, thereby facilitating higher product yields. 

Various signal sequences may be used according to the invention. A signal sequence 
25 which is homologous to the MHC coding sequence may be used. Alternatively, a signal 
sequence which has been selected or designed for efficient secretion and processing in the 
expression host may also be used. For example, suitable signal sequence/host cell pairs 
include the B. subtilis sacB signal sequence for secretion in 5. subtilis, and the Saccharomyces 
cerevisiae ot-mating factor or P. pastoris acid phosphatase phol signal sequences for P. 

-3G pastoris secretion^ The signal-sequence may be-joined directiy through-the s 

the signal peptidase cleavage site to the protein coding sequence, or through a short nucleotide 
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bridge consisting of usually fewer than ten codons, where the bridge ensures correct reading 
frame of the downstream TCR sequence. 

Elements for enhancing transcription and translation have been identified for eukary otic 
5 protein expression systems. For example, positioning the cauliflower mosaic virus (CaMV) 
promoter 1000 bp on either side of a heterologous promoter may elevate transcriptional levels 
by 10- to 400-fold in plant cells. The expression construct should also include the appropriate 
translational initiation sequences. Modification of the expression construct to include a Kozak 
consensus sequence for proper translational initiation may increase the level of translation by 
10 10 fold. 

A selective marker is often employed, which may be part of the expression construct 
or separate from it (e.g. , carried by the expression vector), so that the marker may integrate 
at a site different from the gene of interest. Examples include markers that confer resistance 

15 to antibiotics (e.g., bla confers resistance to ampicillin for E. coli host cells, nptll confers 
kanamycin resistance to a wide variety of prokaryotic and eukaryotic cells) or that permit the 
host to grow on minimal medium (e.g. , HIS4 enables P. pastoris or His" S. cerevisiae to grow 

in the absence of histidine) . The selectable marker has its own transcriptional and translational 

initiation and termination regulatory regions to allow for independent expression of the marker. 

20 If antibiotic resistance is employed as a marker, the concentration of the antibiotic for selection 
will vary depending upon the antibiotic, generally ranging from 10 to 600 \ig of the 
antibiotic/mL of medium. 

The expression construct is assembled by employing known recombinant DNA 
25 techniques (Sambrook et at, 1989; Ausubel et al, 1999). Restriction enzyme digestion and 
ligation are the basic steps employed to join two fragments of DNA. The ends of the DNA 
fragment may require modification prior to ligation, and this may be accomplished by filling 
in overhangs, deleting terminal portions of the fragment(s) with nucleases (e.g. , ExoTH), site 
directed mutagenesis, or by adding new base pairs by PCR. Polylinkers and adaptors may be 
30 ^mployedl:^f^Uitatejoining of ^electoi ffagments^ The ^p^siOTTcoMt^n^t^ically 
assembled in stages employing rounds of restriction, ligation, and transformation of E. coli. 
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Homlogons recombination strategies can be used with, co-electroporation of linear DN As into 
yeast (see Fig. 9). Numerous cloning vectors suitable for construction of the expression 
construct are known in the art (XZAP and pBLUESCRIPT SK-1, Stratagene, LaJolla, CA; 
pET, Novagen Inc., Madison, WI; cited in Ausubel et al, 1999) and the particular choice is 
not critical to the invention. The selection of cloning vector will be influenced by the gene 
transfer system selected for introduction of the expression construct into the host cell. At the 
end of each stage, the resulting construct may be analyzed by restriction, DNA sequence, 
hybridization and PCR analyses. 

The expression construct may be transformed into the host as the cloning vector 
construct, either linear or circular, or may be removed from the cloning vector and used as is 
or introduced onto a delivery vector. The delivery vector facilitates the introduction and 
maintenance of the expression construct in the selected host cell type. The expression 
construct is introduced into the host cells by any of a number of known gene transfer systems 
(e.g., natural competence, chemically mediated transformation, protoplast transformation, 
electroporation, biolistic transformation, transfection, or conjugation) (Ausubel et al , 1999; 
Sambrook et al, 1989). The gene transfer system selected depends upon the host cells and 
vector systems used. 

For instance, the expression construct can be introduced into S. cerevisiae cells by 
protoplast transformation or electroporation. Electroporation of S. cerevisiae is readily 
accomplished, and yields transformation efficiencies comparable to spheroplast transformation. 

■ 

Monoclonal or polyclonal antibodies, preferably monoclonal, specifically reacting with 
a protein of interest may be made by methods known in the art. See, e.g. , Harlow and Lane 
(1988) Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratories; Goding (1986) 
Monoclonal Antibodies: Principles and Practice, 2d ed., Academic Press, New York; and 
Ausubel et al. (1999) Current Protocols in Molecular Biology, John Wiley & Sons, Inc. , New 
York. 
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MHC/peptide complexes of improved stability in cell-bound or soluble form which are 
characteristic of a particular neoplastic condition (cancer, tumor, or the like) or a particular 
virus infected cell or pathogen infected cell are useful, for example, as agonists of the immune 
system so that the neoplastic cell, virus infected cell or pathogen infected cell is more 
efficiently targeted for removal by T cells of the immune system. The improved MHC/p 
complexes can be labeled by joining, either covalently or noncovalently, a substance which 
provides a detectable signal. Suitable labels include but are not limited to radionuclides, 
enzymes, substrates, cofactors, inhibitors, fluorescent agents, chemiluminescent agents, 
magnetic particles and the like. Additionally the MHC protein of the present invention can be 
coupled to a ligand for a second binding molecules: for example, the MHC protein can be 
biotinylated. Detection of the MHC protein or complex can be effected by binding of a 
detectable streptavidin (a streptavidin to which a fluorescent, radioactive, chemiluminescent, 
or other detectable molecule is attached or to which an enzyme for which there is a 
chromophoric substrate available). United States patents describing the use of such labels 
and/or toxic compounds to be covalently bound to the scTCR protein include but are not 
limited to Nos. 3,817,837; 3,850,752; 3,927,193; 3,939,350; 3,996,345; 4,277,437; 
4,275,149; 4,331,647; 4,348,376; 4,361,544; 4,468,457; 4,444,744; 4,640,561; 4,366,241; 
RE 35,500; 5,299,253; 5,101,827; 5,059,413. Labeled MHC proteins or complexes can be 
detected using a monitoring device or method appropriate to the label used. Fluorescence 
microscopy or fluorescence activated cell sorting can be used where the label is a fluorescent 
moiety, and where the label is a radionuclide, gamma counting, autoradiography or liquid 
scintillation counting, for example, can be used with the proviso that the method is appropriate 
to the sample being analyzed and the radionuclide used. In addition, there can be secondary 
detection molecules or particle employed where there is a detectable molecule or particle which 
recognized the portion of the MHC protein which is not part of the binding site for the cognate 
TCR or other ligand or other ligand in the absence of a component as noted herein. The art 
knows useful compounds for diagnostic imaging in situ; see, e.g., U.S. PatentNo. 5,101,827; 
5,059,413. Radionuclides useful for therapy and/or imaging in vivo include ln Indium, 
97 Rubidium, 125 Iodine, 131 Iodine, 123 Iodine, ^Gallium, "Technetium. Toxins include diphtheria 
toxinrricinand-castorbean-tox^ 

complex is bound to the cell, the toxic moiety is internalized so that it can exert its cytotoxic 
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effect. Immunotoxin technology is well known to the art, and suitable toxic molecules include, 
without limitation, chemotherapeutic drugs such as vindesine, antifolates, e.g. methotrexate, 
cisplatin, mitomycin, anthrocyclines such as daunomycin, daunorubicin or adriamycin, and 
cytotoxic proteins such as ribosome inactivating proteins (e.g., diphtheria toxin, pokeweed 
antiviral protein, abrin, ricin, pseudomonas exotoxin A or their recombinant derivatives. See, 
generally, e.g., Olsnes and Pihl (1982) Pharmac. Ther. 25:355-381 and Monoclonal 
Antibodies for Cancer Detection and Therapy, Eds. Baldwin and Byers, pp. 159-179, 
Academic Press, 1985. 

Table, high affinity MHC proteins specific for a particular ligand, e.g., a particular 
peptide, protein or cell type, are useful in diagnosing animals, including humans, believed to 
be suffering from a disease associated with the particular pMHC. The MHC molecules of the 
present invention are useful for detecting T cells that are specific for essentially any antigen 
including, but not limited to, those associated with a neoplastic condition, an abnormal protein, 
or an infection or infestation with a bacterium, a fungus, a virus, a protozoan, a yeast, a 
nematode or other parasite. Stable, high affinity MHC proteins specific for a particular ligand 
can also be used to induce the activity of T cells against antigens if desirable. For example, 
there are many peptides that have been associated with neoplastic cells, abnormal proteins, 
bacteria, fungi, a viruses, and protozoans whereby said peptides bind to a Class I MHC 
protein. Stable, high affinity MHC proteins in complex with these peptides could serve as 
vaccines against the diseases by inducing T cell activity. 

The high affinity MHC compositions can be formulated by any of the means known in 
the art. They can be typically prepared as injectables, especially for intravenous, 
intraperitoneal or synovial administration (with the route determined by the particular disease) 
or as formulations for intranasal or oral administration, either as liquid solutions or 
suspensions. Solid forms suitable for solution in, or suspension in, liquid prior to injection or 
other administration may also be prepared. The preparation may also, for example, be 
emulsified, or the protein(s)/peptide(s) encapsulated in liposomes. 



26 



WO 02/46399 



PCT/US01/47817 



The active ingredients are often mixed with excipients or carriers which are 
pharmaceutical^ acceptable and compatible with the active ingredient. Suitable excipients 
include but are not limited to water, saline, dextrose, glycerol, ethanol, or the like and 
combinations thereof. The concentration of the MHC protein in injectable, aerosol or nasal 
formulations is usually in the range of 0.05 to 5 mg/ml. Similar dosages can be administered 
to other mucosal surfaces. 



In addition, if desired, vaccines may contain minor amounts of auxiliary substances 
such as wetting or emulsifying agents, pH buffering agents, and/or adjuvants which enhance 
the effectiveness of the vaccine. Examples of adjuvants which may be effective include but are 
not limited to: aluminum hydroxide; N-acetyl-muramyl-Lrthreonyl-D-isoglutamine (thr-MDP); 
N-acetyl-nor-muramyl-^ (CGP 11637, referred to as nor-MDP); 

acetylmuramyl-L-alanyl-D-isoglm^ 

3hydroxyphosphoryloxy)-ethylamine (CGP 19835A, referred to as MTP-PE); and RIBI, which 
contains three components extracted from bacteria: monophosphoryl lipid A, trehalose 
dimycolate and cell wall skeleton (MPL-f TDM+CWS) in a 2% squalene/Tween 80 emulsion. 
Such additional formulations and modes of administration as are known in the art may also be 
used. 



The stable high affinity MHC proteins of the present invention and/or pMHC-binding 
fragments having primary structure similar (more than 90% identity) to the high affinity MHC 
proteins and which maintain the high affinity for the cognate ligand may be formulated into 
vaccines as neutral or salt forms. Pharmaceutical^ acceptable salts include but are not limited 
to the acid addition salts (formed with free amino groups of the peptide) which are formed with 
inorganic acids, e.g., hydrochloric acid or phosphoric acids; and organic acids, e.g., acetic, 
oxalic, tartaric, or maleic acid. Salts formed with the free carboxyl groups may also be 
derived from inorganic bases, e.g., sodium, potassium, ammonium, calcium, or ferric 
hydroxides, and organic bases, e.g., isopropylamine, trimethylamine, 2-ethylamino-ethanol, 
histidine, and procaine. Alternatively, these stable high affinity MHC proteins can be used as 
antagonistsofminteracti^^ 
cognate TCR cells. 
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High affinity MHC proteins or complexes for therapeutic use, e.g., those conjugated 
to cytotoxic compounds are administered in a manner compatible with the dosage formulation, 
and in such amount and manner as are prophylactically and/or therapeutically effective, 
according to what is known to the art. The quantity to be administered, which is generally in 
the range of about 100 to 20,000 pg of protein per dose, more generally in the range of about 
1000 to 10,000 iig of protein per dose. Similar compositions can be administered in similar 
ways using labeled high affinity MHC proteins for use in imaging, for example, to detect 
deleterious cytotoxic T cells that are involved in autoimmune attacks and containing the 
cognate pMHCs. Precise amounts of the active ingredient required to be administered may 
depend on the judgment of the physician or veterinarian and may be peculiar to each 
individual, but such a determination is well within the skill of such a practitioner. 



The vaccine or other immunogenic composition may be given in a single dose; two dose 
schedule, for example two to eight weeks apart; or a multiple dose schedule. A multiple dose 
schedule is one in which a primary course of vaccination may include 1 to 10 or more separate 
doses, followed by other doses administered at subsequent time intervals as required to 
maintain and/or reinforce the immune response, e.g. , at 1 to 4 months for a second dose, and 
if needed, a subsequent dose(s) after several months. Humans (or other animals) immunize 
with the retrovirus-like particles of the present invention are protected from infection by the 
cognate retrovirus. 

Standard techniques for cloning, DNA isolation, amplification and purification, for 
enzymatic reactions involving DNA ligase, DNA polymerase, restriction endonucleases and - 
the like, and various separation techniques are those known and commonly employed by those 
skilled in the art. A number of standard techniques are described in Sambrook et al. (1989) 
Molecular Cloning, Second Edition, Cold Spring Harbor Laboratory, Plainview, New York; 
Maniatis et al. (1982) Molecular Cloning, Cold Spring Harbor Laboratory, Plainview, New 
York; Wu (ed.) (1993) Meth. Enzymol. 218, Parti; Wu (ed.) (1979) Meth Enzymol. 68; Wu 
et al. (eds.) (1983) Meth. Enzymol. 100 and 101; Grossman and Moldave (eds.) Meth. 
£r^7?z0^65;-Mm^^ 

Laboratory, Cold Spring Harbor, New York; Old and Primrose (1981) Principles of Gene 
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Manipulation, University of California Press, Berkeley; Schleif and Wensink (1982) Practical 
Methods in Molecular Biology; Glover (ed.) (1985) DNA Cloning Vol. I and n, IRL Press, 
Oxford, UK; Hames and Higgins (eds.) (1985) Nucleic Acid Hybridization, IRL Press, Oxford, 
UK; and Setlow and Hollaender (1979) Genetic Engineering: Principles and Methods, Vols. 
1-4, Plenum Press, New York. Abbreviations and nomenclature, where employed, are deemed 
standard in me field and commonly used in professional journals such as those cited herein. 

All references cited in the present application are incorporated by reference herein to 
supplement the disclosure and experimental procedures provided in the present Specification 
to the extent that there is no inconsistency with the present disclosure. 

The following examples are provided for illustrative purposes, and are not intended to 
limit the scope of the invention as claimed herein. Any variations in the exemplified articles 
and/or methods which occur to the skilled artisan are intended to fall within the scope of the 
present invention. 

EXAMPLES 



Example 1. Fusion of the MHC a chain to the p2m in the Yeast Display Vector pCT302 

The mouse MHC K b a chain was fused to the mouse P2m [Mottez et al. (1995) /. Exp. 
Med. 181(2), 493-502]. The K b gene was PCR amplified using primers with Nhe I and Aft H 
restriction sites. The 5' primer contained a 30 bp linker upstream of the K b gene and the 3' 
primer contained a 45 bp connecting linker (underlined) downstream of the gene (5' CAA TGG 
CTA GCG GTG GAC TTA AGG GTG GAC CAG GTG GAG GIT CAG GAG GTG GAG 
GCC CAC ACT CGC TGA GGT ATT TCG T 3', SEQ ID NO:7; and 5' TGA ACC TCC 
GCC TCC TGA TCC ACC GC C ACC TGA ACQ TAT TCC ACC. CTC CCA TCT CAG 
GGT GAG GGG CTC AGG 3 ', SEQ ID NO. 8). The P2m gene was PCR amplified with the 
5' primer containing the overlapping 45 bp linker (underlined) upstream and the 3* primer 
containing a c-tnyc epitope tag and unique Xho I site downstream of the P2m (5 ' GGTGGA 
ATA-GGT-TGA-GGT-Gn P-GGT-GGA-TeA GGA-GrTC GGACtCT T^A- ATf^ r An-A A A- 
ACC CCT CAA ATT CAA GTA T 3', SEQ ID NO:9, and 5 'GTT CCC TCG AGC TAT 
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TAC AAG TCT TCT TCA GAA ATA AGC TTT TGT TCC ATG TCT CGA TCC CAG 
TAG ACG GT 3\ SEQ ID NO: 10). Using PCR "sewing" of the overlapping linker [Davis 
et al. (1992) Biotechnology 9(2), 165-169], an amplified K b /p2m fusion was generated using both 
the K b and P2m PCR products, primers 1 and 4 and TaqPlus Precision PCR System 
5 (Stratagene, La Jolla, CA). The L d /p2m MHC gene was fused and PCR amplified in an 
analogous manner. The K b /p2m and L d /p2m PCR products were digested with Nhe VXho I and 
ligated into the yeast surface display vector pCT302 containing a nine-residue epitope tag (HA) 
and the AGA2 open reading frame downstream of the inducible GAL1 promoter [Boder and 
Wittrup (1997) supra]. 

10 

Example 2. Peptide/MHC Construction. 

The AGA2 gene was cloned at the COOH terminus of the peptide/K b /p2m gene in order 
to allow a free NH2 terminus of the peptide. First, the K b /p2m signal sequence was PCR 
amplified with EcoBI and Nhe I restriction sites upstream and downstream of the signal 
1 5 sequence (SS) (5 ' TCT CAA GAA TTC TAC TTC ATA CAT TTT 3 ', SEQ ID NO: 1 1 ; and 
5' GTA TCT GCT AGC TGC TAA AAC TGA AGC 3', SEQ ID NO: 12). The SS PCR 
product was digested Nhe VEco RI and ligated into the Nhe VEco RI digested pCT302 vector. 
The resulting plasmid contained the SS but not the AGA2 gene. Secondly, a SIYRYYGL 
(SEQ ID NO:l) (SIYR)/K b /p2m PCR product was generated with a SIYR encoded primer 
20 (underlined) (5' ATA CTA GCT AGC TTG GAT AAA AGG TCT ATT TAT AGA TAT 
TAT GGT TTG CTT AAG GGT GGA CCA GGT GGA GGT 3' , SEQ ID NO: 13; and 5 * 
CAA TCC AGA TCT TTA CTA ATG CAA GTC TTC TTC AGA AAT AAG 3', SEQ ID 
NO: 14) . Nhe I and Nde I restriction sites are located in the upstream and downstream primers 
respectively. The SIYR/K b /p2m PCR product was digested with Nhe I and Nde I and ligated 
25 into the Nhe II Nde I digested pCT302 SS vector (now called SS-SIYR K b pCT302). Finally, 
AGA2 was PCR amplified (5' GGA TAT CAT ATG CAG GAA CTG AC A ACT AT A3', 
SEQ ID NO: 15, and 5 ' ATT TGC AGA TCT CGA GTT ACT AAG CGT AGT CTG GAA 
CGT CGT A 3', SEQ ID NO: 16), digested with Nde I and Xho I and ligated into the Nde 
VXhol digested SS-SIYR K b pCT302. The resulting construct contained the following order 
30 of-genes-in-me-pCT302-backbone:-SSrSIYRK b /p2mrAGA2 
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Example 3. OVA and dEV8 Peptide Loading. 

The sense and anti-sense oligonucleotide sequences for both OVA (SHNFEKL, SEQ 
ID NO:2) and dEV8 (EQYKFYSV, SEQ ID NO:5) with a Nhe I and Afl H site upstream and 
downstream respectively were each phosphorylated with T4 DNA kinase (OVA 5' CT AGC 
TTG GAT AAA AGG AGC ATC ATC AAT TTT GAA AAG CTT C3 \ SEQ ID NO: 17; and 
5'TT AAG AAG CTT TTC AAA ATT GAT GAT GCT CCT TTT ATC CAA G3\ SEQ ID 
NO:18; dEV8 5' CT AGC TTG GAT AAA AGG GAA CAA TAC AAA TTC TAC TCA 
GTT C3 \ SEQ ID NO: 19; and 5 "IT AAG AAC TGA GTA GAA TTT GTA TTG TTC CCT 
TTT ATC CAA G3', SEQ ID NO:20). The sense and anti-sense phosphorylated 
oligonucleotides were mixed (400 pmol), heated at 100°C for 1 min and cooled slowly. 
Phosphorylated peptide cassettes were ligated into Nhe II Afl n digested SIYR/K b /p2m pCT302. 
Ligation reactions were transformed into DH10B electrocompetent £ coli cells and plated on 
LB/amp and incubated for 15hrsat37 C. Transformants were screened, and positive clones 
were sequenced. The resulting plasmids contained OVA and dEV8 tethered to K b /p2m. 

Example 4. Transformation into Yeast. 

The resultant MHC nucleotide constructs were transformed by the lithium acetate 
(LiAc) transformation method [Geitz et al. (1995) Yeast 1 1 , 355-360] into the S. cerevisiae strain 
BJ5465 (a ura3-52 trpl leulAl his3A200 pep4::HIS2 prbAl 6 canl GAL; Yeast Genetic Stock 
Center, Berkeley, CA) containing a chromosomally integrated AGA1 coding sequence 
expressed under the control of the GAL1 promoter (strain EBY100; Boder and Wittrup (1997) 
supra). 



Example 5. Induction and Detection of MHC on the Yeast Surface. 

Yeast cells transformed with pCT302/MHC plasmid constructs were grown overnight 
at 30 C with shaking in 2 mL selective glucose medium SD-CAA (glucose 2 wt % , Difco yeast 
nitrogenbase0.67wt %,casaminoacids0.5 wt %). After 18-24 hours, recombinant AGA1 
+ AGA2-MHC I expression was induced at 20°C with shaking in 5 mL selective galactose 
medium (SG-CAA, where 2% galactose replaces the glucose in SD-CAA). Cultures were 
har vested-after ^^8-hours-(-l-2-douM 

NaP0 4 , 150 mM NaCl, pH 7.3) containing 0.5% bovine serum albumin and incubated 45 
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minutes on ice with 25 M L of an anti-MHC antibody, anti-c-royc Mab 9E10 (1 : 100 dilution of 
raw ascites fluid; Berkeley Antibody Co., Richmond, CA), or anti-HA Mab 12CA5 (10 jig/ml. 
Boehringer Mannheim, Indianapolis, IN). Cells were washed with PBS and incubated 30 
minutes on ice with either FITC-Iabeled F(ab') 2 goat anti-mouse IgG (1:50; Kirkegaard and 
Perry Labs, Inc., Gaithersburg, MD) or a streptavidm-phycoeiythrin (SA-PE) conjugate 
(1:200; PharMingen, San Diego, CA). Labeled yeast cells were analyzed on a Coulter Epics 
XL flow cytometer. Data for 10,000-20,000 events were collected, and the population was 
gated according to light scatter (size) to prevent analysis of cell clumps. 



Example 6. Random Mutagenesis of dEV8/K b /p2m and SIYR/ K b /p2m. 

dEV8/K b /p2m and SIYR/K b /p2m genes were randomly mutagenized using a PCR error 
prone technique. dEV8/K b /p2m and SIYR/K b /p2m were PCR amplified using vector specific 
primers at least 50 bp upstream and downstream of each gene with PCR conditions that cause 
random mutations to be inserted by Taq polymerase (GD3CO/BRL, Invitrogen,Carelsbad, 
CA). At a Mn:Mg ratio of about 0.16:1, the polymerase is more susceptible to inserting a 
random nucleotide during elongation. Using homologous recombination [Raymond et al. 
(1999) Biotechniques 26(1): 134-138, 140-141], the dEV8/K b /p2m and SIYRyK b /p2m pCT302 
vectors digested with Nhe and Ndel were combined with the error prone PCR products and 
electroporated (Bio-Rad Gene-Pulser n, 1.5V, 25/tF, 0.2cm gene pulser cuvettes) into 40 pi 
of electrocompetent S. cerevisiae cells (BJ5465, strain EBY100). The resulting 
transformations (separately for the two chimeric genes) are pooled, and a dilution is plated on 
SD-CAA plates. Plates are incubated at 30 C for 3 days, and the library size is tabulated. 



Example 7. Cell Sorting. 

The yeast library was grown in SD-CAA (2% dextrose, 0.67 % yeast nitrogen base, 1 % 
Casamino acids (Difco, Detroit, MI)) at 30 C to an 00^0=4.0. To induce surface scTCR 
expression, yeast cells were pelleted by centrifugation, resuspended to an 00500=1.0 in SG- 
CAA (2% galactose, 0.67% yeast nitrogen base, 1 % casamino acids), and incubated at 20 C 
for 48 hr. In general, 10 7 cells/tube were incubated on ice for 1 hr with 50 fd of 
biotmylated-anti-K— antibody-(B78724T3)-dUuted-m-phospliate-buffer^ 
supplemented with 0.5 mg/ml BSA (PBS-BSA). After incubation, cells were washed and 
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labeled for 30 min with SA.PE in PBS-BSA. Yeast cells were then washed and resuspended 
in PBS-BSA immediately prior to sorting. Cells exhibiting the highest fluorescence were 
isolated by FACS sorting with a Coulter 753 bench. After isolation, sorted cells were 
expanded in SD-CAA and induced in SG-CAA for subsequent rounds of selection. Three 
sequential sorts were performed for each mutant preparation with increasingly dilute anti-K b . 
The percentages of total cells isolated from each sort were 1.0, 0.25 and 0.1%, respectively. 
Aliquots of the third sorts were plated on SD-CAA to isolate individual clones, which were 
then analyzed by flow cytometry using a Coulter Epics XL instrument. The clones with the 
highest fluorescence have their DNA rescued with the Zymoprep Yeast Plasmid Miniprep Kit 
(Zymogen Research, Orange, CA). The DNA is retransformed into DH10 Belectrocompetent 
E. coli, mini prepped and submitted for sequencing. The sequences are analyzed, and the 
mutations are located. 



Table 1. 

Comparison of Properties of H-2K b , H-2D b and H-2 L d Proteins 
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Table 2. 

Various Single-Chain Class I MHC Constructions 
Cloned in pCT302 Yeast Display Vector 

Nucleic Acid Sequence 1 : SS-AGA2-K b /p2m-c-/wyc (SEQ ID NO:22) 

ATGCA6TTACTTCGCTGTTTTTCAATATTTTCT6TTATTGCTTCAGTTTTAGC 
ACAGGAACTGACAACTATATGCGAGCAAATCCCCTCACCAACTTTAGAATCGA 
CGCCGTACTCTTTGTGAACGACTACTATTTTGGCCAACGGGAAGGCAATGCAA 
GGAGTTTTTGAATATTACAAATCAGTAACGTTTGTCAGTAATTGCGGTTCTCA 
CCCCTCAACAACTAGCAAAGGCAGCCCGATAAACACACAGTATGTTTTTAAGG 
ACAATAGCTCGACGATTGAAGGTAGATACCCATACGACGTTCCAGACTACGCT 
CTGCAGGCTAGTGGTGGTGGTGGTTCTGGTGGTGGTGGTTCTGGTGGTGGTGG 
TTCTGCTAGCGGTGGACTTAAGGGTGGACCAGGTGGAGGTTCAGGAGGTGGAG 
GCCCACACTCGCTGAGGTATTTCGTCACCGCCGTGTCCCGGCCCGGCCTCGGG 
GAGCCCCGGTACATGGAAGTCGGCTACGTGGACGACACGGAGTTCGTGCGCTT 
CGACAGCGACGCGGAGAATCCGAGATATGAGCCGCGGGCGCGGTGGATGGAGC 
AGGAGGGGCCCGAGTATTGGGAGCGGGAGACACAGAAAGCCAAGGGCAATGAG 
CAGAGTTTCCGAGTGGACCTGAGGACCCTGCTCGGCTACTACAACCAGAGCAA 
GGGCGGCTCTCACACTATTCAGGTGATCTCTGGCTGTGAAGTGGGGTCCGACG 
GGCGACTCCTCCGCGGGTACCAGCAGTACGCCTACGACGGCTGCGATTACATC 
GCCCTGAACGAAGACCTGAAAACGTGGACGGCGGCGGACATGGCGGCGCTGAT 
CACCAAACACAAGTGGGAGCAGGCTGGTGAAGCAGAGAGACTCAGGGCCTACC 
TGGAGGGCACGTGCGTGGAGTGGCTCCGCAGATACCTGAAGAACGGGAACGCG 
ACGCTGCTGCGCACAGATTCCCCAAAGGCCCATGTGACCCATCACAGCAGACC 
TGAAGATAAAGTCACCCTGAGGTGCTGGGCCCTGGGCTTCTACCCTGCTGACA 
TCACCCTGACCTGGCAGTTGAATGGGGAGGAGCTGATCCAGGACATGGAGCTT 
GTGGAGACCAGGCCTGCAGGGGATGGAACCTTCCAGAAGTGGGCATCTGTGGT 
GGTGCCTCTTGGGAAGGAGCAGTATTACACATGCCATGTGTACCATCAGGGGC 
TGCCTGAGCCCCTCACCCTGAGATGGGAGGGTGGAATAGGTTCAGGTGGCGGT 
GGATCAGGAGGCGGAGGTTCAATCCAGAAAACCCCTCAAATTCAAGTATACTC 
ACGCCACCCACCGGAGAATGGGAAGCCGAACATACTGAACTGCTACGTAACAC 
AGTTCCACCCGCCTCACATTGAAATCCAAATGCTGAAGAACGGGAAAAAAATT 
CCTAAAGTAGAGATGTCAGATATGTCCTTCAGCAAGGACTGGTCTTTCTATAT 
CCTGGCTCACACTGAATTCACCCCCACTGAGACTGATACATACGCCTGCAGAG 

TTAAGCATGACAGTATGGCCGAGCCCAAGACCGTCTACTGGGATCGAGACATG 
GAACAAAAGCTTATTTCTGAAGAAGACTTGTAATAGCTCGAG 



34 



WO 02/46399 

PCT/US01/47817 

Amino Acid Sequence 1: SS-AGA2-K b /p2m-c-wyc (SEQ ID NO:23) 

MQLLRCPSIPSVIASVLAQELTTICEQIPSPTLESTPYSLSTTTILANGKAMO 
6VPEYYKSVTFVSNCGSHPSTTSKGSPINTQYVFKDNSSTIEGRYPYDVPDYA 
LQASGGGGSGGGGSGGGGSASGGLKGGPGGGSGGGGPHSLRYFVTAVSRPGLG 
EPRYMEVGYVDDTEFVRFDSDAENPRYEPRARWMEQEGPEYWERETQKAKGNE 
QSFRVDLRTLLGYYNQSKGGSHTIQVISGCEVGSDGRLLRGYQQYAYDGCDYI 
ALNEDLKTWTAADMAAL ITKHKWEQAGEAERLRAYLEGTCfVEWLRRYLKNGNA 

TLLRTDSPKAHVTHHSRPEDKVTLRCWALGPYPADITLTWQLNGEELIQDMEL 

VETRPAGDGTFQKWASVWPLGKEQYYTCHVYHQGLPEPLTLRWEGGIGSGGG 

GSGGGGSIQKTPQIQVYSRHPPENGKPNILNCYVTQFHPPHIEIQMLKNGKKI 

PKVEMSDMSFSKDWSFYIIAHTEFTPTETDTYACRVKHDSMAEPKTVYWDRDM 
EQKLISEEDL**LE 



Nucleic Acid Sequence 2: SS-SIYR/K b /p2m-c-wyc-AGA2 (SEQ ID NO:24) 

GAATTCTACTTCATACATTTTCAATTAAGATGCAGTTACTTCGCTGTTTTTCA 

ATATTTTCTGTTATTGCTTCAGTTTTAGCAGCTAGCTTGGATAAAAGATCTAT 

TTATAGATATTATGGTTTGCTTAAGGGTGGACCAGGTGGAGGTTCAGGAGGTG 

GAGGCCCACACTCGCTGAGGTATTTCGTCACCGCCGTGTCCCGGCCCGGCCTC 

GGGGAGCCCCGGTACATGGAAGTCGGCTACGTGGACGACACGGAGTTCGTGCG 

CTTCGACAGCGACGCGGAGAATCCGAGATATGAGCCGCGGGCGCGGTGGATGG 

AGGRGGAGGGQCCCGAGTATTGGGAGCGGGAGACAGAGAAAGCCAAGGGGAAT 

GAGCAGAGTTTCCGAGTGGACCTGAGGACCCTGCTCGGCTACTACAACCAGAG 

CAAGGGCGGCTCTCACACTATTCAGGTGATCTCTGGCTGTGAAGTGGGGTCCG 

ACGGGCGACTCCTCCGCGGGTACCAGCAGTACGCCTACGACGGCTGCGATTAC 

ATCGCCCTGAACGAAGACCTGAAAACGTGGACGGCGGCGGACATGGCGGCGCT 

GATCACCAAACACAAGTGGGAGCAGGCTGGTGAAGCAGAGAGACTCAGGGCCT 

ACCTGGAGGGCACGTGCGTGGAGTGGCTCCGCAGATACCTGAAGAACGGGAAC 

GCGACGCTGCTGCGCACAGATTCCCCAAAGGCCCATGTGACCCATCACAGCAG 

ACCTGAAGATAAAGTCACCCTGAGGTGCTGGGCCCTGGGCTTCTACCCTGCTG 

ACATCACCCTGACCTGGCAGTTGAATGGGGAGGAGCTGATCCAGGACATGGAG 
CTTGTGGAGACCAGGCCTGCAGGGGATGGAACCTTCCAGAAGTGGGCATCTGT 
GGTGGTGCCTCTTGGGAAGGAGCAGTATTACACATGCCATGTGTACCATCAGG 
GGCTGCCTGAGCCCCTCACCCTGAGATGGGAGGGTGGAATAGGTTCAGGTGGC 
GGTGGATCAGGAGGCGGAGGTTCAATCCAGAAAACCCCTCAAATTCAAGTATA 
CTCACGCCACCCACCGGAGAATGGGAAGCCGAACATACTGAACTGCTACGTAA 
-GAGAGTTeeACCCGCCTCACATTGAAATCCASSTGCTGAAGAACGGGAAAAAA 
ATTCCTAAAGTAGAGATGTCAGATATGTCCTTCAGCAAGGACTGGTCTTTCTA 
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TATCCTGGCTCACACTGAATTCACCCCCACTGAGACTGATACATAC6CCTGCA 
GAGTTAAGCATGACAGTATGGCCGAGCCCAAGACCGTCTACTGGGATCGAGAC 
ATGGAACAAAAGCTTATTTCTGAAGAAGACTTGCATATGCAGGAACTGACAAC 
TATATGCGAGCAAATCCCCTCACCAACTTTAGAATCGACGCCGTACTCTTTGT 
CAACGACTACTATTTTGGCCAACGGGAAGGCAATGCAAGGAGTTTTTGAATAT 
TACAAATCAGTAACGTTTGTCAGTAATTGCGGTTCTCACCCCTCAACAACTAG 
CAAAGGCAGCCCCATAAACACACAGTATGTTTTTAAGGACAATAGCTCGACGA 
TTGAAGGTAGATACCCATACGACGTTCCAGACTACGCTTAGTAACTCGAG 



Amino Acid Sequence 2: SS-SIYR/K b /p2m-c-wvc-AGA2 (SEQ ID NO:25) 

ILLHTFS IKMQLLRCF S I FS VI AS VLAASLDKRS I YRYYGLLKGGPGGGS GGG 

GPHSLRYFVTAVSRPGLGEPRYMEVGYVDDTEFVRFDSDAENPRYEPRARWME 

QEGPEYWERETQKAKGNEQSFRVDLRTLLGYYNQSKGGSHTIQVISGCEVGSD 

GRLLRGYQQYAYDGCDYIALNEDLKTWT2yUDMAALITKHKWEQAGEAERLRAY 

LEGTCVEWLRRYLKNGNATLLRTDSPKAHVTHHSRPEDKVTLRCWALGFYPAD 

ITLTWQLNGEELIQDMELVETRPAGDGTFQKWASVWPLGKEQYYTCHVYHQG 

LPEPLTLRWEGGIGSGGGGSGGGGSIQKTPQIQVYSRHPPENGKPNILNCYVT 

QFHPPHIEIQMLKNGKKIPKVEMSDMSFSKDWSFYILAHTEFTPTETDTYACR 

VKHDSMAEPKTVYWDRDMEQKLISEEDLHMQELTTICEQIPSPTLESTPYSLS 

TTTILANGKAMQGVFEYYKSVTFVSNCGSHPSTTSKGSPINTQYVFKDNSSTI 
EGRYPYDVPDYA 



Nucleic Acid Sequence 3: SS-dEV8/K"/p2m-c-OTyc-AGA2 (SEQ ID NO:26) 

■ 

GAATTCTACTTCATACATTTTCAATTAAGATGCAGTTACTTCGCTGTTTTTCA 
ATATTTTCTGTTATTGCTTCAGTTTTAGCAGCTAGCTTGGATAAAAGAGAACA 
ATACAAATTCTACTCAGTTCTTAAGGGTGGACCAGGTGGAGGTTCAGGAGGTG 
GAGGCCCACACTCGCTGAGGTATTTCGTCACCGCCGTGTCCCGGCCCGGCCTC 
GGGGAGCCCCGGTACATGGAAGTCGGCTACGTGGACGACACGGAGTTCGTGCG 
CTTCGACAGCGACGCGGAGAATCCGAGATATGAGCCGCGGGCGCGGTGGATGG 
AGCAGGAGGGGCCCGAGTATTGGGAGCGGGAGACACAGAAAGCCAAGGGCAAT 
GAGCAGAGTTTCCGAGTGGACCTGAGGACCCTGCTCGGCTACTACAACCAGAG 
CAAGGGCGGCTCTCACACTATTCAGGTGATCTCTGGCTGTGAAGTGGGGTCCG 
ACGGGCGACTCCTCCGCGGGTACCAGCAGTACGCCTACGACGGCTGCGATTAC 
ATCGCCCTGAACGAAGACCTGAAAACGTGGACGGCGGCGGACATGGCGGCGCT 
— GATGACGAAAGAGAAGTGGGAGCAGGCTGGTGAAGCAGAGAGACTCAGGGCCT 
ACCTGGAGGGCACGTGCGTGGAGTGGCTCCGCAGATACCTGAAGAACGGGAAC 
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GC6ACGCTGCTGCGCACAGATTCCCCAAAGGCCCATGTGACCCATCACAGCAG 

ACCTGAAGATAAAGTCACCCTGAGGTGCTGGGCCCTGGGCTTCTACCCTGCTG 

ACATCACCCTGACCTGGCAGTTGAATGGGGAGGAGCTGATCCAGGACATGGAG 

CTTGTGGAGACCAGGCCTGCAGGGGATGGAACCTTCCAGAAGTGGGCATCTGT 

GGTGGTGCCTCTTGGGAAGGAGCAGTATTACACATGCCATGTGTACCATCAGG 

GGCTGCCTGAGCCCCTCACCCTGAGATGGGAGGGTGGAATAGGTTCAGGTGGC 

GGTGGATCAGGAGGCGGAGGTTCAATCCAGAAAACCCCTCAAATTCAAGTATA 

CTCACGCCACCCACCGGAGAATGGGAAGCCGAACATACTGAACTGCTACGTAA 

CACAGTTCCACCCGCCTCACATTGAAATCCAAATGCTGAAGAACGGGAAAAAA 

ATTCCTAAAGTAGAGATGTCAGATATGTCCTTCAGCAAGGACTGGTCTTTCTA 

TATCCTGGCTCACACTGAATTCACCCCCACTGAGACTGATACATACGCCTGCA 

GAGTTAAGCATGACAGTATGGCCGAGCCCAAGACCGTCTACTGGGATCGAGAC 

ATGGAACAAAAGCTTATTTCTGAAGAAGACTTGCATATGCAGGAACTGACAAC 

TATATGCGAGCAAATCCCCTCACCAACTTTAGAATCGACGCCGTACTCTTTGT 

CAACGACTACTATTTTGGCCAACGGGAAGGCAATGCAAGGAGTTTTTGAATAT 

TACAAATCAGTAACGTTTGTCAGTAATTGCGGTTCTCACCCCTCAACAACTAG 

CAAAGGCAGCCCCATAAACACACAGTATGTTTTTAAGGAC2iATAGCTCGACGA 

TTGAAGGTAGATACCCATACGACGTTCCAGACTACGCTTAGTAACTCGAG 



Amino Acid Sequence 3: SS-dEV8/K b /p2m-c-myc-AGA2 (SEQ ID NO:27) 

ILLHTFSIKMQLLRCPSIFSVIASVLAASLDKREQYKFYSVLKGGPGGGSGGG 

G PHSLRYFVTAVSRPGLGEPRYMBVGYVDDTEFVRFDSDAENPRY E PRARWM E 

QEGPEYWERETQKAKGNEQSFRVDLRTLLGYYNQSKGGSHTIQVISGCEVGSD 

GRLLRGYQQYAYDGCDYIALNEDLKTWTAADMAALITKHKWEQAGEAERLRAy 

LEGTCVEWLRRYLKNGNATLLRTDSPKAHVTHHSRPEDKVTLRCWALGFYPAD 

ITLTWQLNGEELIQDMELVETRPAGDGTFQKWASWVPLGKEQYYTCHVYHQG 

LPEPLTLRWEGGIGSGGGGSGGGGSIQKTPQIQVYSRHPPENGKPNILNCYVT 

QFHPPHIEIQMLKNGKKIPKVEMSDMSFSKDWSFYIIAHTEFTPTETDTYACR 

VKHDSMAEPKTVYWDRDMEQKLISEEDLHMQELTTICEQIPSPTLESTPYSLS 

TTTILANGKAMQGVFEYYKSVTFVSNCGSHPSTTSKGSPINTQYVFKDNSSTI 
EGRYPYDVPDYA 



Nucleic Acid Sequence 4: SS-OVA/K b /p2m-c-myc-AGA2 (SEQ ID NO: 28) 

GAATTCTACTTCATACATTTTCAATTAAGATGCAGTTACTTCGCTGTTTTTCA 
ATATT-T-TCTGTTATIGCCTCAGT 

CATCAATTTTGAAAAGCTTCTTAAGGGTGGACCAGGTGGAGGTTCAGGAGGTG 
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GAGGCCCACACTCGCTGAGGTATTTCGTCACCGCCGTGTCCCGGCCCG6CCTC 

GGGGAGCCCCGGTACATGGAAGTCGGCTACGTGGACGACACGGAGTTCGTGCG 

CTTCGACAGCGACGCGGAGAATCCGAGATATGAGCCGCGGGCGCGGTGGATGG 

AGCAGGAGGGGCCCGAGTATTGGGAGCGGGAGACACAGAAAGCCAAGGGCAAT 

GAGCAGAGTTTCCGAGTGGACCTGAGGACCCTGCTCGGCTACTACAACCAGAG 

CAAGGGCGGCTCTCACACTATTCAGGTGATCTCTGGCTGTGAAGTGGGGTCCG 

ACGGGCGACTCCTCCGCGGGTACCAGCAGTACGCCTACGACGGCTGCGATTAC 

ATCGCCCTGAACGAAGACCTGAAAACGTGGACGGCGGCGGACATGGCGGCGCT 

GATCACCAAACACAAGTGGGAGCAGGCTGGTGAAGCAGAGAGACTCAGGGCCT 

ACCTGGAGGGCACGTGCGTGGAGTGGCTCCGCAGATACCTGAAGAACGGGAAC 

GCGACGCTGCTGCGCACAGATTCCCCAAAGGCCCATGTGACCCATCACAGCAG 

ACCTGAAGATAAAGTCACCCTGAGGTGCTGGGCCCTGGGCTTCTACCCTGCTG 

ACATCACCCTGACCTGGCAGTTGAATGGGGAGGAGCTGATCCAGGACATGGAG 

CTTGTGGAGACCAGGCCTGCAGGGGATGGAACCTTCCAGAAGTGGGCATCTGT 

GGTGGTGCCTCTTGGGAAGGAGCAGTATTACACATGCCATGTGTACCATCAGG 

GGCTGCCTGAGCCCCTCACCCTGAGATGGGAGGGTGGAATAGGTTCAGGTGGC 

GGTGGATCAGGAGGCGGAGGTTCAATCCAGAAAACCCCTCAAATTCAAGTATA 

CTCACGCCACCCACCGGAGAATGGGAAGCCGAACATACTGAACTGCTACGTAA 

CAC^GTTCCACCCGCCTCACATTGAAATCCAAATGCTGAAGAACGGGAAAAAA 

ATTCCTAAAGTAGAGATGTCAGATATGTCCTTCAGCAAGGACTGGTCTTTCTA 

TATCCTGGCTCACACTGAATTCACCCCCACTGAGACTGATACATACGCCTGCA 

GAGTTAAGCATGACAGTATGGCCGAGCCCAAGACCGTCTACTGGGATCGAGAC 
ATGGAACAAAAGCTTATTTCTGAAGAAGACTTGCATATGCAGGAACTGAGAAr 



TATATGCGAGCAAATCCCCTCACCAACTTTAGAATCGACGCCGTACTCTTTGT 
CAACGACTACTATTTTGGCCAACGGGAAGGCAATGCAAGGAGTTTTTGAATAT 
TACAAATCAGTAACGTTTGTCAGTAATTGCGGTTCTCACCCCTCAACAACTAG 
CAAAGGCAGCCCCATAAACACACAGTATGTTTTTAAGGACAATAGCTCGACGA 
TTGAAGGTAGATACCCATACGACGTTCCAGACTACGCTTAGTAACTCGAG 



Amino Acid Sequence 4: SS-OVA/K b /p2m-c-OTyc-AGA2 (SEQ ID NO:29) 

ILLHTPS IKMQLLRCPS IPSVIASVLAASLDKRS I INFEKLLKGGPGGGSGGG 
GPHSLRYFVTAVSRPGLGEPRyMEVGYVDDTEFVRFDSDAENPRYEPRARWME 
QEGPEYWERETQKAKGNEQSFRVDLRTLLGYYNQSKGGSHTIQVISGCEVGSD 
GRLLRGYQQYAYDGCDYIALNEDLKTWTAADMAALITKHKWEQAGEAERLRAY 
LEGTCVEWLRRYLKNGNATLLRTDSPKAHVTHHSRPEDKVTLRCWALGFYPAD 
ITLTWQLNGEELIQDMELVETR PAGDGTFQKWASVWPLGKEQYYTCHVYHQG 
LPisPLTLRWEGGIGSGGGGSGGGGSIQKTPQIQVYSRHPPENGKPNILNCYVT 
QFHPPHIEIQMLKNGKKIPKVEMSDMSFSKDWSPYILAHTEFTPETDTYACRV 
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KHDSMAEPKTVYWDRDMEQKLISEEDLHMQELTTICEQIPSPTLESTPYSLST 

TTII^GK^QGVPEYYKSVTPVSNCGSHPSTTSKGSPINTQYVFKDNSSTIE 
GRYPYDVPDYA AB 



10 



15 



20 



25 



30 



35 



Nucleic Acid Sequence 5: SS-AGA2-L d /p2m-c-wyc (SEQ ID NO:30) 




'TTTAGC 

AC^GGAACTGACAACTATATGCGAGCAAATCCCCTCACCAACTTTAGAATCGA 
CGCCGTACTCTTTGTCAACGACTACTATTTTGGCCAACGGGAAGGCAATGCAA 
GGAGTTTTTGAATATTACAAATCAGTAACGTTTGTCAGTAATTGCGGTTCTCA 
CCCCTCAACAACTAGCAAAGGGAGCCCCATAAft.CACACAGTATGTTTTTAAGG 
ACAATAGCTCGACGATTGAAGGTAGATACCCATACGACGTTCCAGACTACGCT 
CTGCAGGCTAGTGGTGGTGGTGGTTCGGTGGTGGTGGTTCTGGTGGTGGTGGT 
TCTGCTAGCGGTGGACTTAAGGGTGGACCAGGTGGAGGTTCAGGAGGTGGAGG 
CCCACACTCGATGCGGTATTTCGAGACCGCGGTGTCCCGGCGCGGCCTCGGGG 
AGCCCCGGTACATCTCTGTCGGCTATGTGAACGACAAGGAGTTCGTGCGCTTC 
GACAGCGACGCGGAGAATCCGAGATATGAGCCGAGGGCGCCGTGGATGGAGCA 
GGAGGGGCCGGAGTATTGGGAGCGGATCACGCAGATCGCCAAGGGCCAGGAGC 
AGTGGTTCCGAGTGAACCTGAGGACCCTGCTCGGCTACTACAACCAGAGCGCG 
GGCGGCACTCACACACTCCAGTGGATGTACGGCTGTGACGTGGGGTCGGACGG 
GCGCCTCCTCCGCGGGTACGAGCAGTTCGCCTACGACGGCTGCGATTACATCG 
CCCTGAACGAAGACCTGAAAACGTGGACGTTCGCGGACATGTCGTCGATGATC 
AOUCWACGCAAGTGGGAGCAGGCTGGTGCTGCAGAGTATTACAGGGCCTACCT 
GGAGGGCGAGTGCGTGGAGTGGCTCCACAGATACCTGAAGAACGGGAATGCTA 
CGCTGCTGCGCACAGATTCCCCAAAGGCACATGTGACCTATCACCCCAGATCT 
AAAGGTGAAGTCACCCTGAGGTGCTGGGCCCTGGGCTTCTACCCTGCTGACAT 
CACCCTGACCTGGCAGTTGAATGGGGAGGAGCTGACCCAGGACATGGAGCTTG 
TGGAGACCAGGCCTGCAGGGGATGGAACCTTCCAGAAGTGGGCATCTGTGGTG 
GTGCCTCTTGGGAAGGAGCAGAATTACACATGCCGTGTGTACCATGAGGGGCT 
GCCCCATCCCCTCACCCTGAGATGGGAGGGTGGAATAGGTTCAGGTGGCGGTG 
GATCAGGAGGCGGAGGTTCAATCCAGAAAACCCCTCAAATTCAAGTATACTCA 
CGCCACCCACCGGAGAATGGGAAGCCGAACATACTGAACTGCTACGTAACACA 
GTTCCACCCGCCTCACATTGAAATCCAAATGCTGAAGAACGGGAAAAAAATTC 
CTAAAGTAGAGATGTCAGATATGTCCTTCAGCAAGGACTGGTCTTTCTATATC 
CTGGCTCACACTGAATTCACCCCCACTGAGACTGATACATACGCCTGCAGAGT 
TAAGCATGACAGTATGGCCGAGCCCAAGACCGTCTACTGGGATCGAGACATGG 
AACAAAAGCTTATTTCTGAAGAAGACTTGTAATAGCTCGAG 



40 



39 



WO 02/46399 



PCT/US01/47817 



Amino Acid Sequence 5: SS-AGA2-L d /p2m-c-/nyc (SEQ ID NO:31) 

MQLLRCFSIPSVIASVLAQELTTICEQIPSPTLESTPYSLSTTTILANGKAMQ 

GVPEYYKSVTPVSNCGSHPSTTSKGSPINTQYVFKDNSSTIEGRYPYDVPDYA 

LQASGGGGSGGGGSGGGGSASGGLKGGPGGGSGGGGPHSMRYFETAVSRRGLG 

EPRYISVGYVNDKEPVRPDSDAENPRYEPRAPWMEQEGPEYWERITQIAKGQE 

QWFRVNLRTLLGYYNQSAGGTHTLQWMYGCDVGSDGRLLRGYEQPAYDGCDYI 

ALNEDLKTWTFADMSSMITRRKWEQAGAAEYYRAYLEGECVEWLHRYLKNGNA 

TLLRTDSPKAHVTYHPRSKGEVTLRCWALGPYPADITLTWQLNGEELTQDMEL 

VETRPAGDGTFQKWASVWPLGKEQNYTCRVYHEGLPHPLTLRWEGGIGSGGG 

GSGGGGSIQKTPQIQVYSRHPPENGKPNILNCYVTQPHPPHIEIQMLKNGKKI 

PKVEMSDMSPSKDWSFYILAHTEFTPTETDTYACRVKHDSMAEPKTVYWDRDM 
EQKLISEEDL 



Table 3. 

Sequences of mutant clones isolated by sorting from 
dEV8/Kb error-prone PCR library. 

Nucleic Acid Sequence 1: dEP.l (SEQ ID NO:32) 

GAATTCTACTTCATACATTTTCAATTAAGATGCAGTTACTTCGCTGTTTTTCA 

ATATTTTCTGTTATTGCTTCAGTTTTAGCAGCTAGCTTGGATAAAAGAGAACA 
ATACAAATTCTACTCAGTTCTTAAGGGTGGACCAGGTGGAGGTTCAGgAGGTO 

GAGGCCCACACTCGCTGAGGTATTTCGTCACCGCCGTGTCCCGGCCCGGCCTC 
GGGGAGCCCCGGTACATGGAAGTCGGCTACGTGGACGACACGGAGTTCGTGCG 
CTTCGACAGCGACGCGGAGAATCCGAGATATGAGCCGCGGGCGCGGTGGATGG 
AGCAGGAGGGGCCCGAGTATTGGGAGCGGGAGACACAGAAAGCCAAGGGCAAT 
GAGCAGAGTTTCCGAGTGGACCTGAGGACCCTGCTCGGCTACTACAACCAGAG 
CAAGGGCGGCTCTCACACTATTCAGGTGATCTCTGGCTGTGAAGTGGGGTCCG 
ACGGGCGACTCCTCCGCGGGTACCAGCAGTACGCCTACGACGGCTGCGATTAC 
ATCGCCCTGAACGAAGACCTGAAAACGTGGACGGCGGCGGACATGGCGGCGCT 
GATCACCAAACACAAGTGGGAGCAGGCTGGTGAAGCAGAGAGACTCAGGGCCT 
ACCTGGAGGGCACGTGCGTGGAGAGGCTCCGCAGATACCTGAAGAACGGGAAC 
GCGACGCTGCTGCGCACAGATTCCCCAAAGGCCCATGTGACCCATCACAGCAG 
ACCTGAAGATAAAGTCACCCTGAGGTGCTGGGCCCTGGGCTTCTACCCTGCTG 
ACATCACCCTGACCTGGCAGTTGAATGGGGAGGAGCT 
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Amino Acid Sequence 1: dEP.l (SEQ ID NO:33) 

ILLHTFSIBCMQLLRCPSIFSVIASVLAASLDKREQYKPYSVLKGGPGGGSGGG 
GPHSLRYFVTAVSRPGLGEPRYMEVGYVDDTEFVRFDSDAENPRYEPRARWME 
QEGPEYWERETQKAKGNEQSFRVDLRTLLGYYNQSKGGSHTIQVISGCEVGSD 
GRLLRGYQQYAYDGCDYIALNEDLKTWTAADMAALITKHKWEQAGEAERLRAY 
LEGTCVERLRRYLKNGNATLLRTDSPKAHVTHHSRPEDKVTLRCWALGFYPAD 
ITLTWQLNGEELIQDMELVETRPAGDGTFQKWASWVPLGKEQYYTCHVYHQG 
LPEPLTLRWEGGIGSGGGGSGGGGSIQRTPQIQVYSRHPPENGKPNILNCYVT 
QFHPPHI E IQMLKNGKKI PKVEMSDMSFSKDWS FNI LAHTEFTPTETDTYACR 

VKHDSMAEPKTVYWDRDMEQRLISEEDLHMQELTTICEQIPSPTLESTPYSLS 

TTTILANGKAMQGVFEYYKSVTFVSNCGSHPSTTSKGSPINTQYVFKDNSSTI 
EGRYPYDVPDYA 



Nucleic Acid Sequence 2: dEP.3 (SEQ ID NO:34) 

GAATTCTACTTCATACATTTTCAATTAAGATGCAGTTACTTCGCTGTTTTTCA 

ATATTTTCTGTTATTGCTTCAGTTTTAGCAGCTAGCTTGGATAAAAGAGAACA 

ATACAAATTCTACTCAGTTCTTAAGGGTGGACCAGGTGGAGGTTCAGGAGGTG 

GAGGCCCACACTCGCTGAGGTATTTCGTCACCGCCGTGTCCCGGCCCGGCCTC 

GGGGAGCCCCGGTACATGGAAGTCGGCTACGTGGACGACACGGAGTTCGTGCG 

CTTCGACAGCGACGCGGAGAATCCGAGATATGAGCCGCGGGCGCGGTGGATGG 

AGCAGG AQ QGGCCCGAGTATTGGGAGCGGGAGACACAGAAAGCCAAGGGCAAT 

GAGCAGAGTTTCCGAGTGGACCTGAGGACCCTGCTCGGCTACTACAACCAGAG 

CAAGGGCGGCTCTCACACTATTCAGGTGATCTCTGGCTGTGAAGTGGGGTCCG 

ACGGGCGACTCCTCCGCGGGTACCAGCAGTACGCCTACGACGGCTGCGATTAC 

ATCGCCCTGAACGAAGACCTGAAAACGTGGACGGCGGCGGACATGGCGGCGCT 

GATCACCAAACACAAGTGGGAGCAGGCTGGTGAAGCAGAGAGACTCAGGGCCT 

ACCTGGAGGGCACGTGCGTGGAGCGGCTCCGCAGATACCTGAAGAACGGGAAC 
GCGACGCTGCTGCGCACAGATTCCCCAAAGG 



Amino Acid Sequence 2: dEP.3 (SEQ ID NO:35) 

ILLHTFSIKMQLLRCFSIFSVIASVLAASLDKREQYKFYSVLKGGPGGGSGGG 

GPHSLRYFVTAVSRPGLGEPRYMEVGYVDDTEFVRFDSDAENPRYEPRARWME 

QEGPEYWERETQKAKGNEQSFRVDLRTLLGYYNQSKGGSHTIQVISGCEVGSD 

GRLLRGYQQYAYDGCDYIALNEDLKTVraAADMAJ^-ITKHKWEQAGEAERLRA 

LEGTCVERLRRYLKNGNATLLRTDSPKAHVTHHSRPEDKVTLRCWALGFYPAD 
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ITLTWQLNGEELIQDMELVETRPAGDGTPQKWASVWPLGKEQYYTCHVYHQG 

LPEPLTLRWEGGIGSGGGGSGGGGSIQKTPQIQVYSRHPPENGKPNILNCYVT 

QPHPPHIEIQMLKNGKKIPKVEMSDMSFSKDWSPYILAHTEFTPTETDTYACR 

VKHDSMAEPKTVYPJDRDMEQKLISEEDLHMQELTTICEQIPSPTLESTPYSLS 

TTTILANGKAMQGVFEYYKSVTFVSNCGSHPSTTSKGSPINTQYVFKDNSSTI 
EGRYPYDVPDYA 



Nucleic Acid Sequence 3: dEP.4 (SEQ ID NO:36) 

GAATTCTACTTCATACATTTTCAATTAAGATGCAGTTACTTCGCTGTTTTTCA 
ATATTTTCTGTTATTGCTTCAGTTTTAGCAGCTAGCTTGGATAAAAGAGAACA 
ATACAAATTCTACTCAGTTCTTAAGGGTGGACCAGGTGGAGGTTCAGGAGGTG 
GAGGCCCACACTCGCTGAGGTATTTCGTCACCGCCGTGTCCCGGCCCGGCCTC 
GGGGAGCCCCGGTACATGGAAGTCGGCTACGTGGACGACACGGAGTTCGTGCG 
CTTCAACAGCGACGCGGAGAATCCGAGATATGAGCCGCGGGCGCGGTGGATGG 
AGCAGGAGGGGCCCGAGTATTGGGAGCGGGAGACACAGAAAGCCAAGGGCAAT 
GAGCAGAGTTTCCGAGTGGACCTGAGGACCCTGCTCGGCTACTACAACCAGAG 
CAAGGGCGGCTCTCACACTATTCAGGTGATCTCTGGCTGTGAAGTGGGGTCCG 
ACGGGCGACTCCTCCGCGGGTACCAGCAGTACGCCTACGACGGCTGCGATTAC 
ATCGCCCTGAACGAAGACCTGAAAACGTGGACGGCGGCGGACATGGCGGCGCT 
GATCACCAAACACAAGTGGGAGCAGGCTGGTGAAGCAGAGAGACTCAGGGCCT 
ACCTGGAGGGCACGTGCGTGGAGAGGCTCCGCAGATACCTGAAGAACGGGAAC 
GCGACGCTGCTGCGCACAGATTCCCCAAAGGCCCATGTGACCCATCACAGCAG 
ACCTGAAGATAAAGTCACCCTGAGGTGCTGGGCCCTGGGCTTCTACCCTGCTG 
ACATCACCCTGACCTGGCAGTTGAATGGGGAGGAGCTGATCCAGGACATGGAG 
CTTGTGGAGACCAGGCCTGCAGGGGATGGAACCTTCCAGAAGTGGGCATCTGT 
GGTGGTGCCTCTTGGGAAGGAGCAGTATTACACATGCCATGTGTACCATCAGG 
GGCTGCCTGAGCCCCTCACCCTGAGATGGGAGGGTGGAATAGGTTCAGGTGGC 

GGTGGATCAGGAGGCGGAGGTTCAATCCAGAAAACCCCTCAAATTCAAGTATA 
CTCACGCCACCCACCGGAGAATGGGAAGCCG 



Amino Acid Sequence 3: dEP.4 (SEQ ED NO:37) 

ILLHTFSIKMQLLRCFSIFSVIASVLAASLDKREQYKFYSVLKGGPGGGSGGG 
GPHSLRYFVTAVSRPGLGEPRYMEVGYVDDTEFVRFNSDAENPRYEPRARWME 
QEGPEYWERETQKAKGNEQSFRVDLRTLLGYYNQSKGGSHTIQVISGCEVGSD 
GRttR'GYQQYAYD^SCDYIALNEDLKTWTAADMA^ 

LEGTCVERLRRYLKNGNATLLRTDSPKAHVTHHSRPEDKVTLRCWALGFYPAD 
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ITLTWQLNGEELIQDMELVETRPAGDGTFQKWASWVPLGKEQYYTCHVYHQ6 

LPEPLTLRWBGGIGSGGGGSGGGGSIQKTPQIQVYSRHPPENGKPNILNCYVT 

QPHPPHIEIQMLKNGKKIPKVEMSDMSFSKDWSPNILAHTEFTPTETDTYACR 

VKHDSMAEPKTVYWDRDMEQRLISEEDLHMQELTTICEQIPSPTLESTPYSLS 

TTTILANGKAMQGVFEYYKSVTPVSNCGSHPSTTSKGSPINTQYVFKDNSSTI 
EGRYPYDVPDYA 
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WHAT IS CLAIMED IS: 

1. A mutagenized combinatorial library of Major Histocompatibility Complex (MHC) 
Class I chimeric proteins displayed on the surfaces of recombinant yeast cells, wherein 
the mutagenized combinatorial library comprises at least one member MHC Class I 
chimeric protein which is improved in conformational stability or in specific target 
binding as compared with a comparison MHC Class I chimeric protein which has not 
been mutagenized. 

2. The mutagenized combinatorial library of claim 1 wherein the MHC Class I chimeric 
protein comprises a portion mediating binding to the surfaces of the recombinant yeast 
cells and a portion which comprises a specific target binding region of a MHC Class 
I protein. 

3 . The mutagenized combinatorial library of claim 2 wherein the portion mediated binding 
to the surfaces of the recombinant yeast cells is a mating adhesion receptor portion. 

4. The mutagenized combinatorial library of claim 3 wherein the mating adhesion receptor 
portion is an AGA2 portion. 



5 . The mutagenized combinatorial library of any of claims 2 to 4 wherein the chimeric 
protein further comprises a portion characterized by an amino acid sequence of a 
peptide which binds to the binding region of the MHC Class I chimeric protein. 

6. The mutagenized combinatorial library of any of claims 2 to 5 wherein the chimeric 
protein further comprises a portion derived from a c-myc protein and which mediates 
binding to a c-myc specific antibody. 

7. The mutagenized combinatorial library of any of claims 2 to 6 wherein the binding 
region of the MHC Class I chimeric protein specifically binds a specific target selected 
from the group consisting of a neoplastic cell, a virus-infected cell, a fungus-infected 
cell, a parasite-infected cell and a bacterium-infected cell. 
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8. The mutagenized combinatorial library of claim 8 wherein the peptide binding region 
specifically binds a peptide having the amino acid sequence given in SEQ ID NO: 19, 
SEQ ID NO:22 or SEQ ID NO:24. 

9. The mutagenized combinatorial library of claim 8 wherein said chimeric protein 
comprises an amino acid sequence as given in SEQ ID NO: 17. 

10. An isolated mutant MHC Class I chimeric protein, wherein said protein comprises a 
portion mediating binding to the surfaces of the recombinant yeast cells and a portion 
which comprises a peptide binding region of a MHC Class I protein and wherein said 
chimeric protein is improved in stability as compared with an MHC Class I chimeric 
protein which is not a mutant chimeric protein. 

11. The isolated mutant MHC Class I chimeric protein of claim 10 wherein the chimeric 
protein further comprises a portion comprising an amino acid sequence of a peptide 
which binds to the peptide binding region of the MHC Class I protein. 

12. The isolated mutant MHC Class I chimeric protein of claim 10 wherein a peptide 
which binds to the peptide binding region of the MHC Class I protein is associated with 
a neoplasti c or infe c tious disease. 

■ 

13. The isolated mutant MHC Class I chimeric protein of claim 10 wherein the peptide 
binding region specifically binds a peptide having the amino acid sequence given in 
SEQ ID NO: 19, SEQ ID NO:22 or SEQ ID NO:24. 

14. The isolated mutant MHC Class I chimeric protein of claim 1 1 wherein said chimeric 
protein further comprises a detectable label. 

15. The isolated mutant MHC Class I chimeric protein of claim 15 wherein the detectable 
label is a fluorescent moiety, a chromophore, a radionuclide, a chemiluminescent agent, 
a magnetic particle, an enzyme, a cofactor, a substrate or a toxin. 

- 16: A _ method"for detectionof aiymphocytehaving a ^TceUreceptorproteinina'bioiogical - 

sample, said method comprising the steps of contacting the sample with an isolated 
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mutant chimeric protein of claim 14, wherein said chimeric protein is complexed to the 
peptide or wherein the chimeric protein and peptide are covalently bound, wherein said 
chimeric protein comprises a binding region which specifically binds said T cell receptor 
protein under conditions which allow the binding of the T cell receptor protein to the 
chimeric protein, and detecting the chimeric protein bound to the T cell receptor protein. 

17. The method of claim 16 wherein the T lymphocyte is specific for a neoplastic cell, a 

tumor cell, a virus-infected cell, a protozoan-infected cell, a bacterium-infected cell or 
a fungus-infected cell. 

18. The method of claim 16 or 17 wherein the biological sample is cells, a tissue sample, 
biopsy material or bodily fluids. 

19. A method for activating or enhancing an immune response to an abnormal cell selected 
from the group consisting of a neoplastic cell, a tumor cell, a virus-infected cell, a 
parasite-infected cell, a fungus infected cell or a protozoan infected cell in a human or 
animal, said method comprising die step of adnunistering to the patient a therapeutically 
effective amount of an isolated mutant MHC Class I chimeric protein or a mutant MHC 
Class I chimeric protein/peptide complex which is improved in conformational stability 

or improved in binding to T lymphocyte as compared with the MHC Class I chimeric 

protein which is not mutant, whereby the immune response in the human or animal is 
activated or enhanced. 

20. The method of claim 19 wherein the administering is by intravenous, intramuscular, 
intradermal, subcutaneous or intraperitoneal administration. 

21 . The method of claim 20 wherein said isolated mutant protein has a portion comprising 
an amino acid sequence as given in SEQ ID NO: 17. 

22. The method of claim 21 wherein said mutant protein binds a peptide comprising an 
amino acid sequence as given in SEQ ID NO:19, SEQ ID NO:22 or SEQ ID NO:24. 
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23. A combinatorial library of peptides anchored to the surface of yeast cells, each cell 
displaying a peptide of a unique sequence anchored to its surface, wherein the peptide is 
anchored to the surface of the yeast cell by a mating factor sequence. 

24. The combinatorial library of claim 23 wherein the mating factor sequence is an AGA2 
sequence. 

25. A method for identifying a peptide which binds specifically to an MHC protein 
comprising the step of contacting a detectable MHC protein with the combinatorial 
library of claim 24 or 25 under conditions which allow binding of the protein and the 
peptide, and detecting peptide bound to the MHC protein. 
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SEQUENCE LISTING 

<110> Board of Trustees of the University of Illinois 

<120> Mutated Class I Major Histocompatibility proteins and 
Complexes 

<130> 100-00WO 

<140> Not assigned 
<141> 2001-12-10 

<150> 60/254,495 
<151> 2000-12-08 

<160> 37 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 1 

Ser lie Tyr Arg Tyr Tyr Gly Leu 
1 5 



<210> 2 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 2 

Ser lie lie Asn Phe Glu Lys Leu 
1 5 



<210> 3 
<211> 63 
<212> DNA 

<213> Artificial Sequence 
<220> 

-<22 3 >- Description - of ~Arti"f i^ial - ^^ueiice : Synthetic 
nucleotide primer 
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<220> 

<221> miscjEeature 
<222> (1) . . {63) 

<223> N is A 7 T, G or C. 
<400> 3 

atattttctg ttattgcttc agttttagca gctagcttgg ataaaagann snnsrmsaaa 60 

63 

<210> 4 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 4 

Gin Leu Ser Pro Phe Pro Phe Asp Leu 
1 5 



<210> 5 
. <211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 5 

Glu Gin Tyr Lys Phe Tyr Ser Val 
1 5 



<210> 6 
<211> 8 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 6 

Asn Asn Asn Lys Phe Tyr Ser Val 
1 5 



<210> 7 
<211> 79 

<212> DHA 

<213> Artificial Sequence 

<220> 
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<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 7 

caatggctag cggtggactt aagggtggac caggtggagg ttcaggaggt ggaggcccac 60 
actcgctgag gtatttcgt 

79 

<210> 8 
<211> 72 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 



<400> 8 

tgaacctccg cctcctgatc caccgccacc tgaacctatt ccaccctccc atctcagggt 60 

72 



gaggggctca 99 



<210> 9 
<211> 73 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 9 

ggtggaatag gttcaggtgg cggtggatca ggaggcggag gttcaatcca gaaaacccct 60 
caaattcaag tat 73 



<210> 10 
<211> 71 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 10 

gttccctcga gctattacaa gtcttcttca gaaataagct tttgttccat gtctcgatcc 60 
cagtagacgg t 71 

<210> 11 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223 > D escription o f Artific ial S equence: S ynthetic 

nucleotide primer 



<400> 11 

tctcaagaat tctacttcat acatttt 



27 
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27 



<210> 12 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 12 

gtatctgcta gctgctaaaa ctgaagc 
<210> 13 

<211> 72 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 13 

atactagcta gcttggataa aaggtctatt tatagatatt atggtttgct taagggtgga 60 
ccaggtggag gt ~ ?2 

<210> 14 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 

nucleotide primer 



<400> 14 

caatccagat ctttactaat gcaagtcttc ttcagaaata ag 

<210> 15 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 15 

ggatatcata tgcaggaact gacaactata 

<210> 16 

<211> 43 

<212> DNA 

<213> Artificial Sequence 



42 



30 



<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 
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<400> 16 

atttgcagat ctcgagttac taagcgtagt ctggaacgtc gta 4 - 

<210> 17 
<2U> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 17 

ctagcttgga taaaaggagc atcatcaatt ttgaaaagct tc 42 

<210> 18 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 18 

ttaagaagct tttcaaaatt gatgatgctc cttttatcca ag 42 

<210> 19 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 19 

ctagcttgga taaaagggaa caatacaaat tctactcagt tc 42 

<210> 20 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide primer 

<400> 20 

ttaagaactg agtagaattt gtattgttcc cttttatcca ag 42 
<210> 21 

<211>_1632 . 

<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 21 

atgcagttac ttcgctgttt ttcaatattt tctgttattg cttcagtttt agcacaggaa 60 
ctgacaacta tatgcgagca aatcccctca ccaactttag aatcgacgcc gtactctttg 120 
tcaacgacta ctattttggc caacgggaag gcaatgcaag gagtttttga atattacaaa 180 
tcagtaacgt ttgtcagtaa ttgcggttct cacccctcaa caactagcaa aggcagcccc 240 
ataaacacac agtatgtttt taaggacaat agctcgacga ttgaaggtag atacccatac 300 
gacgttccag actacgctct gcaggctagt ggtggtggtg gttctggtgg tggtggttct 360 
OTtggtggtg gttctgctag cggtggactt aagggtggac caggtggagg ttcaggaggt 420 
ggaggcccac actcgctgag gtatttcgtc accgccgtgt cccggcccgg cctcggggag 48 0 
ccccggtaca tggaagtcgg ctacgtggac gacacggagt tcgtgcgctt cgacagcgac 540 
gcggagaatc cgagatatga gccgcgggcg cggtggatgg agcaggaggg gcccgagtat 600 
fc 9ggagcggg agacacagaa agccaagggc aatgagcaga gtttccgagt ggacctgagg 660 
accctgctcg gctactacaa ccagagcaag ggcggctctc acactattca ggtgatctct 720 
ggctgtgaag tggggtccga cgggcgactc ctccgcgggt accagcagta cgcctacgac 780 
ggctgcgatt acatcgccct gaacgaagac ctgaaaacgt ggacggcggc ggacatggcg 840 
gcgctgatca ccaaacacaa gtgggagcag gctggtgaag cagagagact cagggcctac 900 
ctggagggca cgtgcgtgga gtggctccgc agatacctga agaacgggaa cgcgacgctg 960 
ctgcgcacag attccccaaa ggcccatgtg acccatcaca gcagacctga agataaagtc 1020 
accctgaggt gctgggccct gggcttctac cctgctgaca tcaccctgac ctggcagttg 1080 
aatggggagg agctgatcca ggacatggag cttgtggaga ccaggcctgc aggggatgga 1140 
accttccaga agtgggcatc tgtggtggtg cctcttggga aggagcagta ttacacatgc 1200 
catgtgtacc atcaggggct gcctgagccc ctcaccctga gatgggaggg tggaataggt 1260 
tcaggtggcg gtggatcagg aggcggaggt tcaatccaga aaacccctca aattcaagta 1320 
tactcacgcc acccaccgga gaatgggaag ccgaacatac tgaactgcta cgtaacacag 1380 
ttccacccgc ctcacattga aatccaaatg ctgaagaacg ggaaaaaaat tcctaaagta 1440 
gagatgtcag atatgtcctt cagcaaggac tggtctttct atatcctggc tcacactgaa 1500 
ttcaccccca ctgagactga tacatacgcc tgcagagtta agcatgacag tatggccgag 1560 
cccaagaccg tctactggga tcgagacatg gaacaaaagc ttatttctga agaagacttg 1620 
fc ic ta t a g *c t eg - — ac ^ , , „ , „ , , , „ , *i 



<210> 22 
<211> 540 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 22 

Met Gin Leu Leu Arg Cys Phe Ser lie Phe Ser Val He Ala Ser Val 
15 10 15 

Leu Ala Gin Glu Leu Thr Thr He Cys Glu Gin He Pro Ser Pro Thr 

20 25 30 

Leu Glu Ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr He Leu Ala Asn 
35 40 45 

Gly— Ly s-Ala -Me t-Gln-Gly— Va-1— Phe-Glu-Tyr— Tyr- -Lys-Ser- Val- -Thr- -Phe - 
50 55 60 

Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser Lys Gly Ser Pro 
65 70 75 80 
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He Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser Thr He Glu Gly 

85 90 



95 



Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Leu Gin Ala Ser Gly Gly 

100 105 



110 



Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Ser Gly 
115 1 



120 



125 



Gly Leu Lys Gly Gly Pro Gly Gly Gly Ser Gly Gly Gly Gly Pro His 
130 135 



140 



Ser Leu Arg Tyr Phe Val Thr Ala Val Ser Arg Pro Gly Leu Gly Glu 



150 



155 



160 



Pro Arg Tyr Met Glu Val Gly Tyr Val Asp Asp Thr Glu Phe Val Arg 

165 170 



175 



Phe Asp Ser Asp Ala Glu Asn Pro Arg Tyr Glu Pro Arg Ala Arg Trp 

180 i 8 5 



190 



Met Glu Gin Glu Gly Pro Glu Tyr Trp Glu Arg Glu Thr Gin Lys Ala 
195 200 



205 

Lys Gly Asn Glu Gin Ser Phe Arg Val Asp Leu Arg Thr Leu Leu Gly 
210 215 220 

Tyr Tyr Asn Gin Ser Lys Gly Gly Ser His Thr He Gin Val He Ser 
225 230 235 240 

Gly Cys Glu Val Gly Ser Asp Gly Arg Leu Leu Arg Gly Tyr Gin Gin 

245 250 255 



Tyr Ala Tyr Asp Gly Cys Asp Tyr He Ala Leu Asn Glu Asp Leu Lys 

260 265 270 

Thr Trp Thr Ala Ala Asp Met Ala Ala Leu He Thr Lys His Lys Trp 
275 280 285 

Glu Gin Ala Gly Glu Ala Glu Arg Leu Arg Ala Tyr Leu Glu Gly Thr 
290 295 300 



Cys Val Glu Trp Leu Arg Arg Tyr Leu Lys Asn Gly Asn Ala Thr Leu 
305 310 315 320 

Leu Arg Thr Asp Ser Pro Lys Ala His Val Thr His His Ser Arg Pro 

325 330 335 

Glu Asp Lys Val Thr Leu Arg Cys Trp Ala Leu Gly Phe Tyr Pro Ala 

340 345 350 

Asp He Thr Leu Thr Trp Gin Leu Asn Gly Glu Glu Leu He Gin Asp 
355 360 365 



Met Glu Leu Val Glu Thr Arg Pro Ala Gly Asp Gly Thr Phe Gin Lys 
370 375 380 
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Trp Ala Ser Val Val Val Pro Leu Gly Lys Glu Gln ^ ^ ^ 

390 3 " 400 



His Val Tyr His G ln Gly L eu Pro Glu Pro Leu Thr Leu ^ g 

405 410 



Gly Gly lie Gly S er Gly Gly Gly Gly Ser Gly Gly Gly Gly 



425 



415 



Ser lie 



430 



Gin Lys Thr Pro Gin lie Gin Val Tyr Ser Arg His Pro Pro Glu Asn 

440 445 

Gly Lys Pro Asn lie Leu Asn Cys Tyr Val Thr Gin Phe His Pro Pro 



455 



460 



His lie Glu lie Gin Met Leu Lys Asn Gly Lys Lys He Pro 



470 



475 



Lys Val 
480 



Glu Met Ser Asp Met Ser Phe Ser Lys Asp Trp Ser Phe Tyr lie 



485 



Leu 



490 



495 



Ala His Thr Glu Phe Thr Pro Thr Glu Thr Asp Thr Tyr Ala Cys Arg 



505 



510 



Val Lys His Asp Ser Met Ala Glu Pro Lys Thr Val Tyr Trp Asp Arg 

O JL O r " 



520 



525 



Asp Met Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 
530 535 



540 



<210> 23 

< 2 11> 1640 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 



<400> 23 
gaattctact 
ctgttattgc 
gtttgcttaa 
atttcgtcac 
acgtggacga 
cgcgggcgcg 
ccaagggcaa 
agagcaaggg 
ggcgactcct 
acgaagacct 
Srggagcaggc 
ggctccgcag 
-ceeafegtgac- 
gcttctaccc 
acatggagct 
tggtggtgcc 



tcatacattt 

ttcagtttta 

gggtggacca 

cgccgtgtcc 

cacggagttc 

gtggatggag 

tgagcagagt 

cggctctcac 

ccgcgggtac 

gaaaacgtgg 

tggtgaagca 

atacctgaag 

ccatcacagc- 

tgctgacatc 

tgtggagacc 

tcttgggaag 



tcaattaaga 

gcagctagct 

ggtggaggtt 

cggcccggcc 

gtgcgcttcg 

caggaggggc 

ttccgagtgg 

actattcagg 

cagcagtacg 

acggcggcgg 

gagagactca 

aacgggaacg 

agacctgaag" 

accctgacct 

aggcctgcag 

gagcagtatt 



tgcagttact 
tggataaaag 
caggaggtgg 

tc ggggagcc 

acagcgacgc 
ccgagtattg 
acctgaggac 
tgatctctgg 
cctacgacgg 
acatggcggc 
gggcctacct 
cgacgctgct 



tcgctgtttt 
atctatttat 
aggcccacac 
ccggtacatg 
ggagaatccg 
ggagcgggag 
cctgctcggc 
ctgtgaagtg 
ctgcgattac 
gctgatcacc 
ggagggcacg 
gcgcacagat 



ataaagtcac 
ggcagttgaa 
gggatggaac 
acacatgcca 



tcaatatttt 60 
agatattatg 120 
tcgctgaggt 180 
gaagtcggct 240 
agatatgagc 3 00 
acacagaaag 360 
tactacaacc 420 
gggtccgacg 480 
atcgccctga 540 
aaacacaagt 600 
tgcgtggagt 660 
tccccaaa g g 72 0 



cctgaggtgc 
tggggaggag 
cttccagaag 
tgtgtaccat 



tgggccctgg 780 
ctgatccagg 840 
tgggcatctg 900 
caggggctgc 960 
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ctgagcccct 
gcggaggttc 

at srggaagcc 

tccaaatgct 
gcaaggactg 
catacgcctg 
gagacatgga 
tatgcgagca 
ctattttggc 
ttgtcagtaa 
agtatgtttt 
actacgctta 



caccctgaga 

aatccagaaa 

gaacatactg 

gaagaacggg 

gtctttctat 

cagagttaag 

acaaaagctt 

aatcccctca 

caacgggaag 

ttgcggttct 

taaggacaat 

gtaactcgag 



tgggagggtg 

acccctcaaa 

aactgctacg 

aaaaaaattc 

atcotggctc 

catgacagta 

atttctgaag 

ccaactttag 

gcaatgcaag 

cacccctcaa 

agctcgacga 



gaataggttc 

ttcaagtata 

taacacagtt 

ctaaagtaga 

acactgaatt 

tggccgagcc 

aagacttgca 

aatcgacgcc 

gagtttttga 

caactagcaa 

ttgaaggtag 



aggtggcggt 
ctcacgccac 
ccacccgcct 
gatgtcagat 
cacccccact 
caagaccgtc 
tatgcaggaa 
gtactctttg 
atattacaaa 
aggcagcccc 
atacccatac 



ggatcaggag 1020 
ccaccggaga 1080 
cacattgaaa 1140 
atgtccttca 12 00 
gagactgata 1260 
tactgggatc 1320 
ctgacaacta 1380 
tcaacgacta 1440 
tcagtaacgt 1500 
a taaacacac 1560 
gacgttccag 1620 

1640 



<210> 24 
<211> 542 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
peptide 



Synthetic 



<400> 24 



lie Leu Leu His Thr Phe Ser He Lys Met Gin Leu Leu Arg Cys Phe 
1 5 io 



15 



Ser He Phe Ser Val He Ala Ser Val Leu Ala Ala Ser Leu Asp Lys 

20 25 



30 



Arg Ser He Tyr Arg Tyr Tyr Gly Leu Leu Lys Gly Gly Pro Gly Gly 
35 40 45 

Gly Ser Gly Gly Gly Gly Pro His Ser Leu Arg Tyr Phe Val Thr Alar 
50 55 eo 

Val Ser Arg Pro Gly Leu Gly Glu Pro Arg Tyr Met Glu Val Gly Tyr 
65 70 75 so 

Val Asp Asp Thr Glu Phe Val Arg Phe Asp Ser Asp Ala Glu Asn Pro 

85 90 95 

Arg Tyr Glu Pro Arg Ala Arg Trp Met Glu Gin Glu Gly Pro Glu Tyr 

100. 105 no 

Trp Glu Arg Glu Thr Gin Lys Ala Lys Gly Asn Glu Gin Ser Phe Arg 
115 120 125 

r 

Val Asp Leu Arg Thr Leu Leu Gly Tyr Tyr Asn Gin Ser Lys Gly Gly 
130 135 140 

Ser His Thr He Gin Val He Ser Gly Cys Glu Val Gly Ser Asp Gly 
145 150 155 160 

.Arg_Leu_Leu_Arg_Gly_Tyr_Gln-Gln-T>-r -Ala— Tyr-Asp -Gly-Cys-Asp Tyr- 

165 170 175 
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He Ala Leu Asn Glu Asp Leu Lys Thr Trp Thr Ala Ala Asp Met Ala 

180 185 



190 



Ala Leu lie Thr Lys His Lys Trp Glu Gin Ala Gly Glu Ala Glu Arg 
195 ~ - ~ 



200 



205 



Leu Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Trp Leu Arg Arg Tyr 



215 



220 



Leu Lys Asn Gly Asn Ala Thr Leu Leu Arg Thr Asp Ser Pro Lys Ala 
225 " " * 



230 



235 



240 



His Val Thr His His Ser Arg Pro Glu Asp Lys Val Thr Leu Arg Cys 

245 250 " 



255 



Trp Ala Leu Gly Phe Tyr Pro Ala Asp He Thr Leu Thr Trp Gin Leu 

260 265 



270 



Asn Gly Glu Glu Leu He Gin Asp Met Glu Leu Val Glu Thr Arg Pro 

275 280 



285 

Ala Gly Asp Gly Thr Phe Gin Lys Trp Ala Ser Val Val Val Pro Leu 
290 295 300 

Gly Lys Glu Gin Tyr Tyr Thr Cys His Val Tyr His Gin Gly Leu Pro 
305 310 315 320 

Glu Pro Leu Thr Leu Arg Trp Glu Gly Gly He Gly Ser Gly Gly Gly 

325 330 335 

Gly Ser Gly Gly Gly Gly Ser He Gin Lys Thr Pro Gin He Gin Val 

340 345 350 



Tyr ser Arg His Pro Pro Glu Asn Gly Lys Pro Asn He Leu Asn Cys 
355 360 365 

Tyr Val Thr Gin Phe His Pro Pro His He Glu He Gin Met Leu Lys 
370 375 380 

Asn Gly Lys Lys He Pro Lys Val Glu Met Ser Asp Met Ser Phe Ser 
385 390 395 4 00 

Lys Asp Trp Ser Phe Tyr He Leu Ala His Thr Glu Phe Thr Pro Thr 

405 410 415 

Glu Thr Asp Thr Tyr Ala Cys Arg Val Lys His Asp Ser Met Ala Glu 

420 425 430 

Pro Lys Thr Val Tyr Trp Asp Arg Asp Met Glu Gin Lys Leu lie Ser 
435 440 445 

Glu Glu Asp Leu His Met Gin Glu Leu Thr Thr He Cys Glu Gin He 
450 455 460 



Pro Ser Pro Thr Leu Glu Ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr 
465 470 475 480 
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Xle Leu Ala Asn Gly Lys Ala Met Gin Gly Val Phe Glu 



485 



490 



Tyr Tyr Lys 
495 



Ser Val Thr Phe Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser 

500 505 



510 



Lys Gly ser Pro lie Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser 
51 5 520 



525 



Thr He Glu Gly Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
530 535 * ^ 



540 



<210> 25 
<211> 1640 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 



<400> 25 

gaattctact 

ctgttattgc 

cagttcttaa 

atttcgtcac 

acgtggacga 

cgcgggcgcg 

ccaagggcaa 

agagcaaggg 

ggcgactcct 



acgaagacct 

gggagcaggc 

ggctccgcag 

cccatgtgac 

gcttctaccc 

acatggagct 

tggtggtgcc 

ctgagcccct 

gcggaggttc 

atgggaagcc 

tccaaatgct 

gcaaggactg 

catacgcctg 

gagacatgga 

tatgcgagca 

ctattttggc 

ttgtcagtaa 

agtatgtttt 

actacgctta 

<210> 26 



tcatacattt 
ttcagtttta 
gggtggacca 
cgccgtgtcc 
cacggagttc 
gtggatggag 
tgagcagagt 
cggctctcac 
ccgcgggtac 



gaaaacgtgg 

tggtgaagca 

atacctgaag 

ccatcacagc 

tgctgacatc 

tgtggagacc 

tcttgggaag 

caccctgaga 

aatccagaaa 

gaacatactg 

gaagaacggg 

gtctttctat 

cagagttaag 

acaaaagctt 

aatcccctca 

caacgggaag 

ttgcggttct 

taaggacaat 

gtaactcgag 



tcaattaaga 
gcagctagct 
ggtggaggtt 
cggcccggcc 
gtgcgcttcg 
caggaggggc 
ttccgagtgg 
actattcagg 
cagcagtacg 



tgcagttact 
tggataaaag 
caggaggtgg 
tcggggagcc 
acagcgacgc 
ccgagtattg 
acctgaggac 
tgatctctgg 
cctacgacgg 



tcgctgtttt 
agaacaatac 
aggcccacac 
ccggtacatg 
ggagaatccg 
ggagcgggag 
cctgctcggc 
ctgtgaagtg 
ctgcgattac 



tcaatatttt 60 
aaattctact 120 
tcgctgaggt 180 
gaagtcggct 240 
agatatgagc 300 
acacagaaag 360 
tactacaacc 420 

gggtccgacg 48 o 
atc gccctg a 540 



acggcggcgg 

gagagactca 

aacgggaacg 

agacctgaag 

accctgacct 

aggcctgcag 

gagcagtatt 

tgggagggtg 

acccctcaaa 

aactgctacg 

aaaaaaattc 

atcctggctc 

catgacagta 

atttctgaag 

ccaactttag 

gcaatgcaag 

cacccctcaa 

agctcgacga 



acatggcggc 

gggcctacct 

cgacgctgct 

ataaagtcac 

ggcagttgaa 

gggatggaac 

acacatgcca 

gaataggttc 

ttcaagtata 

taacacagtt 

ctaaagtaga 

acactgaatt 

tggccgagcc 

aagacttgca 

aatcgacgcc 

gagtttttga 

caactagcaa 

ttgaaggtag 



gctgatcacc 
ggagggcacg 
gcgcacagat 
cctgaggtgc 
tggggaggag 
cttccagaag 
tgtgtaccat 

aggtggcggt 
ctcacgccac 
ccacccgcct 
gatgtcagat 
cacccccact 
caagaccgtc 
tatgcaggaa 
gtactctttg 
atattacaaa 
aggcagcccc 
atacccatac 



aaacacaagt 600 
tgcgtggagt 660 
tccccaaagg 720 
tgggccctgg 780 
ctgatccagg 840 
tgggcatctg 900 
caggggctgc 960 
ggatcaggag 1020 
ccaccggaga 1080 
cacattgaaa 1140 
atgtccttca 1200 
gagactgata 1260 
tactgggatc 132 0 
ctgacaacta 1380 
tcaacgacta 1440 
tcagtaacgt 1500 
ataaacacac 1560 
gacgttccag 1620 

1640 



■<2X1>~~5"4"2 
<212> PRT 

<213> Artificial Sequence 
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<220> 



<223> Description of Artificial Sequence: Synthetic 
peptide 



<400> 26 



He Leu Leu His Thr Phe Ser lie Lys Met Gin Leu Leu Arg Cys Phe 

5 10 15 

Ser lie Phe Ser val lie Ala Ser Val Leu Ala Ala Ser Leu Asp Lys 

20 25 30 

Arg Glu Gin Tyr Lys Phe Tyr Ser Val Leu Lys Gly Gly Pro Gly Gly 
35 40 45 

Gly Ser Gly Gly Gly Gly Pro His Ser Leu Arg Tyr Phe Val Thr Ala 
50 55 go 

Val Ser Arg Pro Gly Leu Gly Glu Pro Arg Tyr Met Glu Val Gly Tyr 
65 70 75 go 

Val Asp Asp Thr Glu Phe Val Arg Phe Asp Ser Asp Ala Glu Asn Pro 

85 90 95 

Arg Tyr Glu Pro Arg Ala Arg Trp Met Glu Gin Glu Gly Pro Glu Tyr 

100 los 110 

Trp Glu Arg Glu Thr Gin Lys Ala Lys Gly Asn Glu Gin Ser Phe Arg 
115 120 125 

Val Asp Leu Arg Thr Leu Leu Gly Tyr Tyr Asn Gin Ser Lys Gly Gly 
130 135 140 



Sex His xhr lie Gin Val He Ser Gly Cys Glu Val Gly Ser Asp Gly 



145 



150 



155 



160 



Arg Leu Leu Arg Gly Tyr Gin Gin Tyr Ala Tyr Asp Gly Cys Asp Tyr 

165 170 



175 



He Ala Leu Asn Glu Asp Leu Lys Thr Trp Thr Ala Ala Asp Met Ala 

180 185 



190 

Ala Leu He Thr Lys His Lys Trp Glu Gin Ala Gly Glu Ala Glu Arg 
195 200 205 

Leu Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Trp Leu Arg Arg Tyr 
210 215 220 

Leu Lys Asn Gly Asn Ala Thr Leu Leu Arg Thr Asp Ser Pro Lys Ala 
225 230 . 235 ~ 240 

■ 
■ 

His Val Thr His His Ser Arg Pro Glu Asp Lys Val Thr Leu Arg Cys 

245 250 255 



Trp Ala Leu Gly Phe Tyr Pro Ala Asp He Thr Leu Thr Trp Gin Leu 

260 265 270 



12 



WO 02/46399 



PCT/US01/47817 



Asn Gly Glu Glu Leu lie Gin Asp Met Glu Leu Val Glu Thr Arg Pro 



280 



285 



Ala Gly Asp Gly Thr Phe Gin Lys Trp Ala Ser Val Val Val Pro Leu 



295 



300 



Gly Lys Glu Gin Tyr Tyr Thr Cys His Val Tyr His Gin Gly 



310 



315 



Leu Pro 
320 



Glu Pro Leu Thr Leu Arg Trp Glu Gly Gly He Gly Ser Gly Gly Gly 



330 



335 



Gly ser Gly Gly Gly Gly Ser lie Gin Lys Thr Pro Gin He Gin Val 

340 345 



350 



Tyr Ser Arg His Pro Pro Glu Asn Gly Lys Pro Asn lie Leu Asn Cys 
355 360 365 

Tyr Val Thr Gin Phe His Pro Pro His He Glu He Gin Met Leu Lys 



375 



380 



Asn Gly Lys Lys He Pro Lys Val Glu Met Ser Asp Met Ser Phe 

385 



395 



Ser 
400 



Lys Asp Trp Ser Phe Tyr He Leu Ala His Thr Glu Phe Thr Pro Thr 

405 410 



415 



Glu Thr Asp Thr Tyr Ala Cys Arg Val Lys His Asp Ser Met Ala Glu 

420 425 



430 



Pro Lys Thr Val Tyr Trp Asp Arg Asp Met Glu Gin Lys Leu He Ser 
435 440 445 



Glu Glu Asp Leu His Met Gin Glu Leu Thr Thr He Cys Glu Gin He 
450 455 46o 

Pro Ser Pro Thr Leu Glu Ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr 
465 470 475 48O 

He Leu Ala Asn Gly Lys Ala Met Gin Gly Val Phe Glu Tyr Tyr Lys 

485 490 495 

Ser Val Thr Phe Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser 

500 505 510 

Lys Gly Ser Pro He Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser 
515 520 525 

Thr He Glu Gly Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
530 535 540 



<210> 27 

— <2-l-l->— 3.-S4G 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 27 

cStattac SSS? tCaa " aa9a tgo^ttwt tcgctgtttt tcaatatttt 60 
«SS£S ttcagtttta ffcagctagct tggataaaag gagcatoatc aattttgaaa 120 

llZcZlZ 2 9 T f" ggtggag£ftt «Wgt« aggcccacac tcgctgaggt 180 
atttcgtcac cgccgtgtcc cggcccggcc tcggggagcc ccggtacatg gaagtcggct 240 

ScS™ ! aCggagttG Stgcgcttcg acagcgacgo ggagaatccg agatatglgc 300 
tclTallll f ggatggag -WWBO ccgagtattg ggagcgggag acacagaaag 360 
ccaagggcaa tgagcagagt ttccgagtgg acctgaggac cctgctcggc tactacaacc 420 
agagcaaggg cggctctoac actattcagg tgatctctgg ctgtgaagcg gggtcc^cg tlo 
ggcgactcct ccgcgggtac cagcagtacg cctacgacgg ctgcgatcac alcgccctga tlo 
alitor. gaaaacgt ^ acggcggcgg acatggcggo gctgatcacc aaacacaagt 600 
gggagcaggc tggtgaagoa gagagactca gggcctacct ggagggcacg tgcgtggagt 660 
ggctccgcag atacctgaag aacgggaacg cgacgctgct gcgcacagat tccccaaagg 720 
cccatgtgac ccatcacagc agacctgaag ataaagtcac cctgaggtgc tgggccctgg 780 
gcttctaccc tgctgacatc accctgacct ggcagttgaa tggggaggag ctgatccagg 840 
acatggagct tgtggagacc aggcctgcag gggatggaac cttccagaag tgggcatctg 900 
tggtggtgcc tcttgggaag gagcagtatt acacatgcca tgtgtaccat caggggctgc 960 
ctgagcccct caccctgaga tgggagggtg gaataggttc aggtggcggt ggatcaggag 1020 
gcggaggttc aatccagaaa acccctcaaa ttcaagtata ctcacgccac ccaccggaga 1080 
atgggaagcc gaacatactg aactgctacg taacacagtt ccacccgcct caoattgaaa 1140 
tocaaatgct gaagaacggg aaaaaaattc ctaaagtaga gatgtoagat atgtccttca 1200 
goaaggactg gtctttctat atcotggctc acactgaatt cacccccact gagactgata 1260 
catacgcctg oagagttaag catgacagta tggccgagcc caagaccgtc tactgggatc 1320 
gagacatgga acaaaagctt atttctgaag aagacttgca tatgcaggaa ctgaoaacta 1380 
tatgcgagca aatcccctca ccaactttag aatcgacgco gtactctttg tcaacgaota 1440 
ctattttggc caacgggaag gcaatgcaag gagtttttga atattacaaa tcagtaacgt 1500 
ttgtcagtaa ttgcggttct cacccctcaa caactagcaa aggcagcccc ataaacacac 1560 
agtatgtttt taaggacaat agctcgacga ttgaaggtag ataoccatac gaogttcoag 1620 
^ctacgotta gtaactcga g - 



1640 



<210> 28 
<211> 541 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 28 

He Leu Leu His Thr Phe Ser He Lys Met Gin Leu Leu Arg Cys Phe 
1 5 10 15 

Ser He Phe Ser Val He Ala Ser Val Leu Ala Ala Ser Leu Asp Lys 

20 25 30 

Arg Ser He He Asn Phe Glu Lys Leu Leu Lys Gly Gly Pro Gly Gly 
35 40 45 



Gly-Ser-Gly-Gly-Gly-Gly-Pro-His-Ser-LW-A^Ty^Ph^Val^hr^Al^ 
50 55 60 



14 



WO 02/46399 



PCT/US01/47817 



Val Ser Arg Pro Gly Leu Gly Glu Pro Arg Tyr Met Glu Val Gly Tyr 

70 75 Qft 



Val Asp Asp Thr Glu Phe Val Arg Phe Asp Ser Asp Ala Glu 

85 90 



80 

Asn Pro 
95 



Arg Tyr Glu Pro Arg Ala Arg Trp Met Glu Gin Glu Gly Pro Glu Tyr 

1 Q Q -' — 



105 



110 



Trp Glu Arg Glu Thr Gin Lys Ala Lys Gly Asn Glu Gin Ser Phe Arg 
115 - ~ 3 



120 



125 



Val Asp Leu Arg Thr Leu Leu Gly Tyr Tyr Asn Gin Ser Lys Gly Gly 



135 



140 



Ser His Thr He Gin Val lie Ser Gly Cys Glu Val Gly Ser Asp Gly 



150 



155 



160 



Arg Leu Leu Arg Gly Tyr Gin Gin Tyr Ala Tyr Asp Gly Cys Asp Tyr 

165 170 



175 



He Ala Leu Asn Glu Asp Leu Lys Thr Trp Thr Ala Ala Asp Met Ala 

180 185 



190 



Ala Leu He Thr Lys His Lys Trp Glu Gin Ala Gly Glu Ala Glu Arq 
195 200 



205 



Leu Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Trp Leu Arg Arg Tyr 
210 215 220 

Leu Lys Asn Gly Asn Ala Thr Leu Leu Arg Thr Asp Ser Pro Lys Ala 
225 230 235 240 



His Val Thr His His Ser Arg Pro Glu Asp Lys Val Thr Leu Arg Cys 

245 250 



255 



Trp Ala Leu Gly Phe Tyr Pro Ala Asp He Thr Leu Thr Trp Gin Leu 

260 265 



270 



Asn Gly Glu Glu Leu He Gin Asp Met Glu Leu Val Glu Thr Arq Pro 
275 280 



285 

Ala Gly Asp Gly Thr Phe Gin Lys Trp Ala Ser Val Val Val Pro Leu 
290 295 300 

Gly Lys Glu Gin Tyr Tyr Thr Cys His Val Tyr His Gin Gly Leu Pro 
305 315 320 

Glu Pro Leu Thr Leu Arg Trp Glu Gly Gly He Gly Ser Gly Gly Gly 

325 330 335 

Gly Ser Gly Gly Gly Gly Ser He Gin Lys Thr Pro Gin lie Gin Val 

34 ° 345 350 



Tyr Ser Arg His Pro Pro Glu Asn Gly Lys Pro Asn He Leu Asn Cys 
355 360 . 365 
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Tyr Val Thr Gin Phe His Pro Pro His lie Glu He Gin Met Leu Lys 
370 -11-..- 



375 



380 



Asn Gly Lys Lys He Pro Lys Val Glu Met Ser Asp Met Ser Phe Ser 
385 390 395 4 oo 

Lys Asp Trp Ser Phe Tyr He Leu Ala His Thr Glu Phe Thr Pro Glu 

405 410 415 

Thr Asp Thr Tyr Ala Cys Arg Val Lys His Asp Ser Met Ala Glu Pro 

420 425 430 

Lys Thr Val Tyr Trp Asp Arg Asp Met Glu Gin Lys Leu He Ser Glu 
435 440 445 

Glu Asp Leu His Met Gin Glu Leu Thr Thr He Cys Glu Gin He Pro 
450 455 460 

Ser Pro Thr Leu Glu Ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr He 
465 47 0 475 480 

Leu Ala Asn Gly Lys Ala Met Gin Gly Val Phe Glu Tyr Tyr Lys Ser 

485 490 495 

Val Thr Phe Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser Lys 

500 505 sio 

Gly Ser Pro He Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser Thr 
515 520 525 



lie Glu Gly Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
530 535 540 



<210> 29 
<211> 1631 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 



<400> 29 

atgcagttac ttcgctgttt ttcaatattt tctgttattg cttcagtttt agcacaggaa 60 
ctgacaacta tatgcgagca aatcccctca ccaactttag aatcgacgcc gtactctttg 120 
tcaacgacta ctattttggc caacgggaag gcaatgcaag gagtttttga atattacaaa 180 
tcagtaacgt ttgtcagtaa ttgcggttct cacccctcaa caactagcaa aggcagcccc 240 
ataaacacac agtatgtttt taaggacaat agctcgacga ttgaaggtag atacccatac 300 
gacgttccag actacgctct gcaggctagt ggtggtggtg gttcggtggt ggtggttctg 360 
gtggtggtgg ttctgctagc ggtggactta agggtggacc aggtggaggt tcaggaggtg 420 
gaggcccaca ctcgatgcgg tatttcgaga ccgcggtgtc ccggcgcggc ctcggggagc 480 
cccggtacat ctctgtcggc tatgtgaacg acaaggagtt cgtgcgcttc gacagcgacg 540 
cggagaatAc_gagatatgag_ccgagggcgc^ 

gggagcggat cacgcagatc gccaagggcc aggagcagtg gttccgagtg aacctgagga 660 
ccctgctcgg ctactacaac cagagcgcgg gcggcactca cacactccag tggatgtacg 720 
gctgtgacgt ggggtcggac gggcgcctcc tccgcgggta cgagcagttc gcctacgacg 780 
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gctgcgatta 
cgatgatcac 
tggagggcga 
tgcgcacaga 
ccctgaggtg 

at ggggagga 

ccttccagaa 

gtgtgtacca 

ca ggtggcgg 

actcacgcca 

tccacccgcc 

agatgtcaga 

tcacccccac 

ccaagaccgt 

aatagctcga 



catcgccctg 
ccgacgcaag 
gtgcgtggag 
ttccccaaag 

ctgggccctg 

gctgacccag 
gtgggcatct 
tgaggggctg 
tggatcagga 
cccaccggag 
tcacattgaa 
tatgtccttc 
tgagactgat 
ctactgggat 

g 



aacgaagacc 

tgggagcagg 

tggctccaca 

gcacatgtga 

ggcttctacc 

gacatggagc 

STtggtggtgc 

ccccatcccc 

ggcggaggtt 

aatgggaagc 

atccaaatgc 

agcaaggact 

acatacgcct 

cgagacatgg 



tgaaaacgtg 

ctggtgctgc 

gatacctgaa 

cctatcaccc 

ctgctgacat 

ttgtggagac 

ctcttgggaa 

tcaccctgag 

caatccagaa 

cgaacatact 

tgaagaacgg 

ggtctttcta 

gcagagttaa 

aacaaaagct 



gacgttcgcg 
agagtattac 
gaacgggaat 
cagatctaaa 
caccctgacc 
caggcctgca 
ggagcagaat 
atgggagggt 
aacccctcaa 
gaactgctac 
gaaaaaaatt 
tatcctggct 
gcatgacagt 
tatttctgaa 



gacatgtcgt 840 
agggcctacc 900 
gctacgctgc 960 
ggtgaagtca 1020 
tggcagttga 1080 
ggggatggaa 1140 
tacacatgcc 1200 
ggaataggtt 1260 
attcaagtat 1320 
gtaacacagt 1380 
cctaaagtag 1440 
cacactgaat 1500 
atggccgagc 1560 
gaagacttgt 1620 
1631 



<210> 30 
<211> 540 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> peptide^ 011 ° f ArtifiGial Se ^uce: Synthetic 



<400> 30 



Met Gin Leu Leu Arg Cys Phe Ser lie Phe Ser Val He Ala 
1 5 10 



Ser Val 
15 



Leu Ala Gin Glu Leu Thr Thr lie rv« ri™ r-i„ n 

inr lie cys Glu Gin He Pro Ser Pro Thr 

25 30 



Leu Glu Ser Thr Pro Tyr Sor Leu Ser Thr Thr T hr lie Leu Ala Asn 
35 4Q ^ 



Gly Lys Ala Met Gin Gly Val Phe Glu Tyr Tyr Lys Ser Val Thr Phe 

55 g 0 



Gly Ser Pro 
80 



Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser Lys 
65 7 ° 75 

lie Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser Thr lie Glu Gly 

Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala Leu Gin Ala Ser Gly Gly 

100 105 110 



Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Ala Ser Gly 
115 120 125 



Gly Leu Lys Gly Gly Pro Gly Gly Gly Ser Gly Gly Gly Gly Pro Hi 



135 



His 



X40 



Ser.mt^ g ^yr-^e -oin-^-^-V^-S^^^ r g Gly Leu 01^1^ 



150 



155 



160 
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Pro Arg Tyr He Ser Val Gly Tyr Val Asn Asp Lys Glu Phe Val Arg 

165 170 



175 



Phe Asp ser Asp Ala Glu Asn Pro Arg Tyr Glu Pro Arg Ala Pro Trp 

180 ~ v 



185 



190 



Met Glu Gin Glu Gly Pro Glu Tyr Trp Glu Arg lie Thr Gin lie Ala 
195 200 



205 



Lys Gly Gin Glu Gin Trp Phe Arg Val Asn Leu Arg Thr Leu Leu Gly 



215 



220 



Tyr Tyr Asn Gin Ser Ala Gly Gly Thr His Thr Leu Gin Trp Met Tyr 
225 



230 



235 



240 



Gly Cys Asp Val Gly Ser Asp Gly Arg Leu Leu Arg Gly Tyr Glu Gin 

245 250 



255 



Phe Ala Tyr Asp Gly Cys Asp Tyr He Ala Leu Asn Glu Asp Leu Lys 

260 265 



270 



Thr Trp Thr Phe Ala Asp Met Ser Ser Met He Thr Arg Arg Lys Trp 
275 280 



285 



Glu Gin Ala Gly Ala Ala Glu Tyr Tyr Arg Ala Tyr Leu Glu Gly Glu 
290 295 300 

Cys Val Glu Trp Leu His Arg Tyr Leu Lys Asn Gly Asn Ala Thr Leu 
305 310 315 320 

Leu Arg Thr Asp Ser Pro Lys Ala His Val Thr Tyr His Pro Arg Ser 

325 330 335 



Lys Gly Glu Val Thr Leu Arg Cys Trp Ala Leu Gly Phe Tyr Pro Ala 

340 345 350 

Asp He Thr Leu Thr Trp Gin Leu Asn Gly Glu Glu Leu Thr Gin Asp 
355 360 365 

Met Glu Leu Val Glu Thr Arg Pro Ala Gly Asp Gly Thr Phe Gin Lys 
3 ™ 375 380 

Trp Ala Ser Val Val Val Pro Leu Gly Lys Glu Gin Asn Tyr Thr Cys 
385 390 395 400 

Arg Val Tyr His Glu Gly Leu Pro His Pro Leu Thr Leu Arg Trp Glu 

405 410 415 

Gly Gly He Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser He 

420 425 430 

Gin Lys Thr Pro Gin He Gin Val Tyr Ser Arg His Pro Pro Glu Asn 
435 440 445 



Gly Lys Pro Asn He Leu Asn Cys Tyr Val Thr Gin Phe His Pro Pro 
450 455 460 
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His lie Glu lie Gin Met Leu Lys Asn Gly Lys Lys He Pro Lys Val 
5 470 475 4eo 



Leu 



Glu Met Ser Asp Met Ser Phe Ser Lys Asp Trp Ser Phe Tyr He 

485 490 495 

Ala His Thr Glu Phe Thr Pro Thr Glu Thr Asp Thr Tyr Ala Cys Arg 



500 505 



510 



Val Lys His Asp Ser Met Ala Glu Pro Lys Thr Val Tyr Trp Asp Arg 
515 520 5 25 

Asp Met Glu Gin Lys Leu He Ser Glu Glu Asp Leu 
530 535 54 0 



<210> 31 
<211> 832 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 



<400> 31 

gaattctact tcatacattt tcaattaaga tgcagttact tcgctgtttt tcaatatttt 60 
ctgttattgc ttcagtttta gcagctagct tggataaaag agaacaatac aaattctact 120 
cagttcttaa gggtggacca ggtggaggtt caggaggtgg aggcccacac tcgctgaggt 180 
atttcgtcac cgccgtgtcc cggcccggcc tcggggagcc ccggtacatg gaagtcggct 240 
acgtggacga cacggagttc gtgcgcttcg acagcgacgc ggagaatccg agatatgagc 300 
cgcgggcgcg gtggatggag caggaggggc ccgagtattg ggagcgggag acacagaaag 360 
ccaagggcaa tgagcagagt . ttccg a gtgg acctgaggac cct g ct cg gc tactacaacc 420 
agagcaaggg cggctctcac actattcagg tgatctctgg ctgtgaagtg gggtccgacg 480 
ggcgactcct ccgcgggtac cagcagtacg cctacgacgg ctgcgattac atcgccctga 540 
acgaagacct gaaaacgtgg acggcggcgg acatggcggc gctgatcacc aaacacaagt 600 
gggagcaggc tggtgaagca gagagactca gggcctacct ggagggcacg tgcgtggaga 660 
ggctccgcag atacctgaag aacgggaacg cgacgctgct gcgcacagat tccccaaagg 720 
cccatgtgac ccatcacagc agacctgaag ataaagtcac cctgaggtgc tgggccctgg 780 
gcttctaccc tgctgacatc accctgacct ggcagttgaa tggggaggag ct 832 

<210> 32 
<211> 542 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

■ 

<400> 32 

He Leu Leu His Thr Phe Ser He Lys Met Gin Leu Leu Arg Cys Phe 
1 5 io 15 



Ser He Phe Ser Val He Ala Ser Val Leu Ala Ala Ser Leu Asp Lys 

20 25 30 
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Arg Glu Gin Tyr Lys Phe Tyr Ser Val Leu Lys Gly Gly Pro Gly Gly 
35 40 



45 

Gly Ser Gly Gly Gly Gly Pro His Ser Leu Arg Tyr Phe Val Thr Ala 
50 55 60 

Val Ser Arg Pro Gly Leu Gly Glu Pro Arg Tyr Met Glu Val Gly Tyr 
65 70 75 80 

Val Asp Asp Thr Glu Phe Val Arg Phe Asp Ser Asp Ala Glu Asn Pro 

85 90 95 

Arg Tyr Glu Pro Arg Ala Arg Trp Met Glu Gin Glu Gly Pro Glu Tyr 

100 105 no 

Trp Glu Arg Glu Thr Gin Lys Ala Lys Gly Asn Glu Gin Ser Phe Arg 
115 120 125 

Val Asp Leu Arg Thr Leu Leu Gly Tyr Tyr Asn Gin Ser Lys Gly Gly 
130 135 140 



Ser His Thr He Gin Val He Ser Gly Cys Glu Val Gly Ser Asp Gly 
145 150 155 



160 



Arg Leu Leu Arg Gly Tyr Gin Gin Tyr Ala Tyr Asp Gly Cys Asp Tyr 

165 170 175 

He Ala Leu Asn Glu Asp Leu Lys Thr Trp Thr Ala Ala Asp Met Ala 

180 185 190 

Ala Leu He Thr Lys His Lys Trp Glu Gin Ala Gly Glu Ala Glu Arg 
195 200 205 



Leu Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Arg Leu Arg Arg Tyr 
210 215 220 

Leu Lys Asn Gly Asn Ala Thr Leu Leu Arg Thr Asp Ser Pro Lys Ala 
225 230 235 240 

His Val Thr His His Ser Arg Pro Glu Asp Lys Val Thr Leu Arg Cys 

245 250 255 

Trp Ala Leu Gly Phe Tyr Pro Ala Asp He Thr Leu Thr Trp Gin Leu 

260 265 270 

Asn Gly Glu Glu Leu He Gin Asp Met Glu Leu val Glu Thr Arg Pro 
275 280 . 285 

Ala Gly Asp Gly Thr Phe Gin Lys Trp Ala Ser Val Val Val Pro Leu 
290 295 300 



Gly Lys Glu Gin Tyr Tyr Thr Cys His Val Tyr His Gin Gly Leu Pro 
305 310 315 " 320 



Glu Pro Leu Thr Leu Arg Trp Glu Gly Gly He Gly Ser Gly Gly Gly 

325 330 335 
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Gly Ser Gly Gly Gly Gly Ser He Gin Lys Thr Pro Gin lie 



340 



345 



Gin Val 



350 



Tyr Ser Arg His Pro Pro Glu Asn Gly Lys Pro Asn He Leu Asn Cys 
355 360 



365 

Tyr Val Thr Gin Phe His Pro Pro His lie Glu lie Gin Met Leu Lys 
370 375 380 Y 

Asn Gly Lys Lys He Pro Lys Val Glu Met Ser Asp Met Ser Phe Ser 
85 390 395 400 

Lys Asp Trp Ser Phe Asn He Leu Ala His Thr Glu Phe Thr Pro Thr 

405 410 415 

Glu Thr Asp Thr Tyr Ala Cys Arg Val Lys His Asp Ser Met Ala Glu 

420 425 43Q 

Pro Lys Thr Val Tyr Trp Asp Arg Asp Met Glu Gin Arg Leu He Ser 
435 440 445 

Glu Glu Asp Leu His Met Gin Glu Leu Thr Thr He Cys Glu Gin He 
450 455 460 



Pro Ser Pro Thr Leu Glu Ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr 
465 47 ° 475 480 

He Leu Ala Asn Gly Lys Ala Met Gin Gly Val Phe Glu Tyr Tyr Lys 

485 490 495 

Ser Val Thr Phe Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser 

500 505 510 



Lys Gly Ser Pro He Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser 
515 520 525 

Thr He Glu Gly Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
530 535 540 



<210> 33 
<211> 720 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 

<400> 33 

gaattctact tcatacattt tcaattaaga tgcagttact tcgctgtttt tcaatatttt 60 
ctgttattgc ttcagtttta gcagctagct tggataaaag agaacaatac aaattctact 120 
cagttcttaa gggtggacca ggtggaggtt caggaggtgg aggcccacac tcgctgaggt 180 
att-tcgt-cac-cgccgtgtcc-cggcccggcc-tcggggagcc^cggtacatg gaagtcggct 240 
acgtggacga cacggagttc gtgcgcttcg acagcgacgc ggagaatccg agatatgagc 300 
cgcgggcgcg gtggatggag caggaggggc ccgagtattg ggagcgggag acacagaaag 360 
ccaagggcaa tgagcagagt ttccgagtgg acctgaggac cctgctcggc tactacaacc 420 
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agagcaaggg cggctctcac actattcagg tgatctctgg ctgtgaagtg gggtccgacg 480 
ggcgactcct ccgcgggtac cagcagtacg cctacgacgg ctgcgattac atcgccctga 540 
acgaagacct gaaaacgtgg acggcggcgg acatggcggc gctgatcacc aaacacaagt 600 
gggagcaggc tggtgaagca gagagactca gggcctacct ggagggcacg tgcgtggagc 660 
ggctccgcag atacctgaag aacgggaacg cgacgctgct gcgcacagat tccccaaagg 720 

<210> 34 
<211> 542 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description. of Artificial Sequence: Synthetic 
peptide 

<400> 34 

lie Leu Leu His Thr Phe Ser lie Lys Met Gin Leu Leu Arg Cys Phe 
1 5 io X5 

Ser He Phe Ser Val lie Ala Ser Val Leu Ala Ala Ser Leu Asp Lys 

20 25 30 

Arg Glu Gin Tyr Lys Phe Tyr Ser Val Leu Lys Gly Gly Pro Gly Gly 
35 40 " 45 

Gly Ser Gly Gly Gly Gly Pro His Ser Leu Arg Tyr Phe Val Thr Ala 
50 55 60 

Val Ser Arg Pro Gly Leu Gly Glu Pro Arg Tyr Met Glu Val Gly Tyr 
6S 70 75 80 

Val Asp Asp Thr Glu Phe Val Arg Phe Asp Ser Asp Ala Glu Asn Pro 

as &o £5 



Arg Tyr Glu Pro Arg Ala Arg Trp Met Glu Gin Glu Gly Pro Glu Tyr 

100 105 no 

Trp Glu Arg Glu Thr Gin Lys Ala Lys Gly Asn Glu Gin Ser Phe Arg 
115 120 125 

Val Asp Leu Arg Thr Leu Leu Gly Tyr Tyr Asn Gin Ser Lys Gly Gly 
130 135 140 

Ser His Thr He Gin Val He Ser Gly Cys Glu Val Gly Ser Asp Gly 
145 150 155 160 

Arg Leu Leu Arg Gly Tyr Gin Gin Tyr Ala Tyr Asp Gly Cys Asp Tyr 

165 170 175 

He Ala Leu Asn Glu Asp Leu Lys Thr Trp Thr Ala Ala Asp Met Ala 

180 185 190 

Ala Leu He Thr Lys His Lys Trp Glu Gin Ala Gly Glu Ala Glu Arg 

195 200 205 



Leu Arg Ala Tyr Leu Giu Gly Thr Cys Val Glu Arg Leu Arg Arg Tyr 
210 215 220 
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Leu Lys Asn Gly Asn Ala Thr Leu Leu Arg Thr Asp Ser Pro Lys Ala 



230 



235 



240 



His Val Thr His His Ser Arg Pro Glu Asp Lys Val Thr Leu Arg Cys 

245 250 



255 



Trp Ala Leu Gly Phe Tyr Pro Ala Asp He Thr Leu Thr Trp Gin Leu 

260 265 



270 

Asn Gly Glu Glu Leu He Gin Asp Met Glu Leu Val Glu Thr Arg Pro 
275 280 285 

Ala Gly Asp Gly Thr Phe Gin Lys Trp Ala Ser Val Val Val Pro Leu 
290 295 300 

Gly Lys Glu Gin Tyr Tyr Thr Cys His Val Tyr His Gin Gly Leu Pro 
305 310 315 3 2 o 

Glu Pro Leu Thr Leu Arg Trp Glu Gly Gly He Gly Ser Gly Gly Gly 

325 330 335 

Gly Ser Gly Gly Gly Gly Ser He Gin Lys Thr Pro Gin He Gin Val 

340 345 350 

Tyr Ser Arg His Pro Pro Glu Asn Gly Lys Pro Asn He Leu Asn Cys 
355 360 365 

Tyr Val Thr Gin Phe His Pro Pro His He Glu He Gin Met Leu Lys 
3 ?0 375 380 

Asn Gly Lys Lys He Pro Lys Val Glu Met Ser Asp Met Ser Phe Ser 
385 390 395 400 



Lys Asp Trp Ser Phe Tyr He Leu Ala His Thr Glu Phe Thr Pro Thr 

405 410 415 

Glu Thr Asp Thr Tyr Ala Cys Arg Val Lys His Asp Ser Met Ala Glu 

420 425 430 

Pro Lys Thr Val Tyr Trp Asp Arg Asp Met Glu Gin Lys Leu He Ser 
435 440 445 

Glu Glu Asp Leu His Met Gin Glu Leu Thr Thr He Cys Glu Gin He 
450 455 460 

Pro Ser Pro Thr Leu Glu Ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr 
465 470 475 480 

He Leu Ala Asn Gly Lys Ala Met Gin Gly Val Phe Glu Tyr Tyr Lys 

485 490 495 

Ser Val Thr Phe Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser 

500 505 510 



Lys Gly Ser Pro He Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser 
515 520 525 
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Thr He Glu Gly Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
530 535 54Q 

<210> 35 
<211> 1091 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
nucleotide 



<400> 35 

gaattctact tcatacattt tcaattaaga tgcagttact tcgctgtttt tcaatatttt 60 
ctgttattgc ttcagtttta gcagctagct tggataaaag agaacaatac aaattctact 120 
cagttcttaa gggtggacca ggtggaggtt caggaggtgg aggcccacac tcgctgaggt 180 
atttcgtcac cgccgtgtcc cggcccggcc tcggggagcc ccggtacatg gaagtcggct 240 
acgtggacga cacggagttc gtgcgcttca acagcgacgc ggagaatccg agatatgagc 300 
cgcgggcgcg gtggatggag caggaggggc ccgagtattg ggagcgggag acacagaaag 360 
ccaagggcaa tgagcagagt ttccgagtgg acctgaggac cctgctcggc tactacaacc 420 
agagcaaggg cggctctcac actattcagg tgatctctgg ctgtgaagtg gggtccgacg 480 
ggcgactcct ccgcgggtac cagcagtacg cctacgacgg ctgcgattac atcgccctga 540 
acgaagacct gaaaacgtgg acggcggcgg acatggcggc gctgatcacc aaacacaagt 600 
gggagcaggc tggtgaagca gagagactca gggcctacct ggagggcacg tgcgtggaga 660 
ggctccgcag atacctgaag aacgggaacg cgacgctgct gcgcacagat tccccaaagg 720 
cccatgtgac ccatcacagc agacctgaag ataaagtcac cctgaggtgc tgggccctgg 780 
gcttctaccc tgctgacatc accctgacct ggcagttgaa tggggaggag ctgatccagg 840 
acatggagct tgtggagacc aggcctgcag gggatggaac cttccagaag tgggcatctg 900 
tggtggtgcc tcttgggaag gagcagtatt acacatgcca tgtgtaccat caggggctgc 960 
ctgagcccct caccctgaga tgggagggtg gaataggttc aggtggcggt ggatcaggag 1020 
gcggaggttc aatccagaaa acccctcaaa ttcaagtata ctcacgccac ccaccggaga 1080 
atgggaagcc g w ~ ^ inQ _ 



<210> 36 
<211> 542 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 



<400> 36 

He Leu Leu His Thr Phe Ser He Lys Met Gin Leu Leu Arg Cys Phe 
15 io i 5 

Ser He Phe Ser Val He Ala Ser Val Leu Ala Ala Ser Leu Asp Lys 

20 25 30 

Arg Glu Gin Tyr Lys Phe Tyr Ser Val Leu Lys Gly Gly Pro Gly Gly 
35 40 45 

Gly Ser Gly Gly Gly Gly Pro His Ser Leu Arg Tyr Phe Val Thr Ala 
50 55 60 



Val Ser Arg Pro Gly Leu Gly Glu Pro Arg Tyr Met Glu Val Gly Tyr 
65 70 75 80 
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Val Asp Asp Thr Glu Phe Val Arg Phe Asn Ser Asp Ala Glu 

85 30 



Asn Pro 
95 



Arg Tyr Glu Pro Arg Ala Arg Trp Met Glu Gin Glu Gly Pro Glu 



100 



105 



Tyr 



110 



Trp Glu Arg Glu Thr Gin Lys Ala Lys Gly Asn Glu Gin Ser Phe Arg 
115 -~- 3 



120 



125 



Val Asp Leu Arg Thr Leu Leu Gly Tyr Tyr Asn Gin Ser Lys Gly Gly 
130 ■* ~* *■* 



135 



140 



Ser His Thr He Gin Val He Ser Gly Cys Glu Val Gly Ser Asp Gly 



150 



155 



160 



Arg Leu Leu Arg Gly Tyr Gin Gin Tyr Ala Tyr Asp Gly Cys Asp Tyr 

165 170 



175 



He Ala Leu Asn Glu Asp Leu Lys Thr Trp Thr Ala Ala Asp Met Ala 

180 i 85 



190 



Ala Leu He Thr Lys His Lys Trp Glu Gin Ala Gly Glu Ala Glu Arcx 
195 200 "~ 



205 

Leu Arg Ala Tyr Leu Glu Gly Thr Cys Val Glu Arg Leu Arg Arg Tyr 
210 215 220 

Leu Lys Asn Gly Asn Ala Thr Leu Leu Arg Thr Asp Ser Pro Lys Ala 
225 230 235 2 4o 

His Val Thr His His Ser Arg Pro Glu Asp Lys Val Thr Leu Arg Cys 

245 250 255 



Trp Ala Leu Gly Phe Tyr Pro Ala Asp He Thr Leu Thr Trp Gin Leu 

260 265 270 

Asn Gly Glu Glu Leu He Gin Asp Met Glu Leu Val Glu Thr Arg Pro 
275 280 285 

Ala Gly Asp Gly Thr Phe Gin Lys Trp Ala Ser Val Val Val Pro Leu 
290 295 300 

Gly Lys Glu Gin Tyr Tyr Thr Cys His Val Tyr His Gin Gly Leu Pro 
305 310 315 320 

Glu Pro Leu Thr Leu Arg Trp Glu Gly Gly He Gly Ser Gly Gly Gly 

325 330 335 

Gly Ser Gly Gly Gly Gly Ser He Gin Lys Thr Pro Gin He Gin Val 

340 345 350 

Tyr Ser Arg His Pro Pro Glu Asn Gly Lys Pro Asn He Leu Asn Cys 
35 5 360 365 



Tyr Val Thr Gin Phe His Pro Pro His He Glu He Gin Met Leu Lys 
370 375 380 



25 



WO 02/46399 



PCT/US01/47817 



Asn Gly Lys Lys He Pro Lys Val Glu Met Ser Asp Met Ser Phe Ser 
385 390 395 400 

Lys Asp Trp Ser Phe Asn He Leu Ala His Thr Glu Phe Thr Pro Thr 

405 410 415 

Glu Thr Asp Thr Tyr Ala Cys Arg Val Lys His Asp Ser Met Ala Glu 

420 425 430 

Pro Lys Thr Val Tyr Trp Asp Arg Asp Met Glu Gin Arg Leu He Ser 
435 440 445 

Glu Glu Asp Leu His Met Gin Glu Leu Thr Thr He Cys Glu Gin He 
450 455 460 

Pro Ser Pro Thr Leu Glu Ser Thr Pro Tyr Ser Leu Ser Thr Thr Thr 
465 470 475 480 

He Leu Ala Asn Gly Lys Ala Met Gin Gly Val Phe Glu Tyr Tyr Lys 

485 490 495 

Ser Val Thr Phe Val Ser Asn Cys Gly Ser His Pro Ser Thr Thr Ser 

500 505 510 

Lys Gly Ser Pro He Asn Thr Gin Tyr Val Phe Lys Asp Asn Ser Ser 
515 520 525 



Thr He Glu Gly Arg Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
530 535 540 



<210> 37 

<211> 9 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
peptide 

<400> 37 

His Tyr Ser Pro Phe Arg Gin Leu Ala 
1 5 
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